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WO 99/64564 PCTYUS99/12697 
GENES CODING FOR TOMATO (3-GALACTOSIDASE 
POLYPEPTIDES 



Field of the Invention 

The present invention relates to a family of novel plant genes encoding 
polypeptides characterized by their ability to hydrolyze terminal non-reducing 
p-D-galactosyl residues from P-D-galactosides. More specifically, a 
polynucleotide sequence derived from a cDNA clone designated pZBG2-l-4 
(referred to in U.S. Provisional Appln. No. 60/088,805 as pTompgal 4), which 
encodes a specific plant polypeptide named P-galactosidase n, is provided. 
Also provided are cDNA clones encoding six other homologous polypeptides, 
methods of using these cDNA clones for producing P-D-galactoside 
polypeptides of the invention, and methods of modifying fruit quality by 
employment of a polynucleotide or polypeptide of the present invention. 

Background of the Invention 
The most conspicuous and important processes related to post-harvest 
quality of climacteric fruit are the changes in texture, color, taste, and aroma 
which occur during ripening. Because of the critical relationship that 
deleterious changes in texture have to quality and post-harvest shelf-life, 
emphasis has been placed on studying the mechanisms involved in the loss of 
firmness that occurs'during tomato fruit ripening. Although fruit softening 
may involve changes in turgor pressure, anatomical characteristics and cell 
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wall integrity, it is generally assumed that cell wall disassembly leading to a 
loss of wall integrity is a critical feature. The most apparent changes, in terms 
of composition and size, occur in the pectic fraction of the cell wall (see 
references in Seymour and Gross, 1996). 

Changes known to occur in the pectic fraction of the cell wall during fruit 
ripening include increased solubility, depolymerization, de-esterification and a 
significant net loss of neutral sugar containing side chains (Huber, 1983; 
Fischer and Bennett, 1991; Seymour and Gross, 1996). The best characterized 
pectin-modifying enzymes are polygalacturonase (endo-otl-»4-D-galacturonan 
hydrolase; E.C. 3.2 J. 15; PG) and pectin methylesterase (E.C. 3.1.1.11; PME). 
Although PG and PME are relatively abundant and have substantia] activity 
during tomato fruit ripening, softening still occurs, albeit with a slight delay, in 
fruit where PG (Smith et al 1988, 1990) or PME (Tieman et al 1992; Hall et 
al 1993) gene expression and enzyme activity was significantly down- 
regulated in transgenic plants. Moreover, over-expression of PG in non- 
ripening mutant rin tomato fruit did not result in softening even though 
depolymerization and solubilization of pectin was evident (Giovannoni et al, 
1989). 

Among the other known pectin modifications that occur during fruit 
development, one of the best characterized is the significant net loss of 
galactosyl residues which occurs in the cell walls of many ripening fruit (Gross 
and Sams, 1984; Seymour and Gross, 1996). Although some loss of galactosyl 
residues could result indirectly from the action of PG, [J-galactosidase (exo- 
P(l-*4)-D-galactopyranoside; E.C. 3.2.1.23) is the only enzyme identified in 
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higher plants capable of directly cleaving (J(l-*4)galactan bonds, and probably 
plays a role in galactan sidechain loss (DeVeau et al, 1993; Carey et al, 1995; 
Carrington and Pressey, 1996). No endo-acting galactanase has yet been 
identified in higher plants. The view that (J-galactosidase is active in releasing 
galactosyl residues from the cell wall during ripening is supported by the 
dramatic increase in free galactose, a product of (3-galactosidase activity 
(Gross, 1984) and a concomitant increase in activity of a particular enzyme, 
designated [3-galactosidase n, in tomatoes during ripening (Carey et al, 1995). 
(i-galactosidase activity is thought to be important in cell wall metabolism 
(Carey et al., 1995). (5-Galactosidases are generally assayed using artificial 
substrates such as p-nitrophenyl-p-D-galactopyranoside (PNP), 4- 
methylumbelliferyl-P-D-galactopyranoside and 5-bromo-4-chloro-3-indoxyl- 
P-D-galactopyranoside (X-GAL). However, it is clear that (J-galactosidase H 
is also active against natural substrates, i.e., p (l-*4)galactan (Carey et al, 
1995; Carrington and Pressey, 1996; Pressey, 1983). p-Galactosidase proteins 
have been purified and characterized in a number of other fruits including 
kiwifruits (Ross et al, 1993), coffee (Golden et al, 1993), persimmon (Kang 
et al, 1994), and apple (Ross et al, 1994). 

Carey et al (1995) were able to purify three previously identified [5- 
galactosidases from ripening tomato fruit (Pressey, 1983), but only one (|3- 
galactosidase II) was active against P(l-»4)galactan. Even though they were 
able to identify putative (3-galactosidase cDNA clones, none of the cDNA's 
deduced amino acid sequences matched the amino terminal sequence of the (}- 
galactosidase II protein. Although (3-galactosidase II, a protein present in 
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tomato (Lycopersicon esculentum Mill.) fruit during ripening and capable of 
degrading tomato fruit galactan has been purified, cloning of the 
corresponding gene has been elusive. 

The modification of plant gene expression has been achieved by several 
methods. The molecular biologist can choose from a range of known methods 
to decrease or increase gene expression or to alter the spatial or temporal 
expression of a particular gene. For example, the expression of either specific 
antisense RNA or partial (truncated) sense RNA has been utilized to reduce 
the expression of various target genes in plants (as reviewed by Bird and Ray, 
1991 , Biotechnology and Genetic-Engineering Reviews 9:207-227). These 
techniques involve the incorporation into the genome of the plant of a 
synthetic gene designed to express either antisense or sense RNA. They have 
been successfully used to down-regulate the expression of a range of 
individual genes involved in the development and ripening of tomato fruit 
(Gray et al, 1992, Plant Molecular Biology, i9:69-87). Methods to increase the 
expression of a target gene have also been developed. For example, additional 
genes designed to express RNA containing the complete coding region of the 
target gene may be incorporated into the genome of the plant to "over-express" 
the gene product. Various other methods to modify gene expression are 
known; for example, the use of alternative regulatory sequences. The 
complete disclosure of each of the references cited above is fully incorporated 
herein by reference. 

The need therefore exists to clone a gene for (5-galactosidase II and related 
polypeptides, and using known methods of modification of plant gene 
expression, thereby to provide methods for modifying quality of fruits, 
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particularly by modifying the cell wall, thereby directly affecting the ripening 
of the fruit. 



Summary of the Invention 

The present invention is based on the discovery of novel DNA sequences 
derived from cDNA clones from a family of genes encoding (3-galactosidases. 
The phylogenic tree based on the shared amino acid sequence identities for the 
DNA sequences of the present invention is shown in Figure 1 A3- Five cDNA 
and two RT-PCR clones, designated herein as TBG1, TBG2, TBG3, TBG4, 
TBG5, TBG6, and TBG7 and having the nucleic acid sequences designated 
SEQ ID NOs 1-7, respectively as shown in Figure 2, were identified which had 
a high degree of shared sequence identity to other known (3-galactosidases. 
The corresponding amino acid sequences are designated herein as SEQ ID 
NOs 8-16, respectively and are shown in Figure 2 and 3. The nucleotide 
sequences for SEQ ID NOs 1-7 are recorded in Gen Bank with the following 
respective Accessions Numbers: 



SEQ ID NO: 1 


TGB1 


AF023847 


deposit Sept 10, 1997 


SEQ ED NO:2 


TGB2 


AF 154420 


deposited May 19, 1999 


SEQ ID NO: 3 


TGB3 


AF 154421 


deposited May 20, 1999 


SEQ ID NO:4 


TGB4 


AF020390 


deposited Aug 21, 1997 


SEQ ID NO:5 


TGB5 


AF154423 


deposited May 20, 1999 


SEQ ID NO:6 


TGB6 


AF 154424 


deposited May 20, 1999 


SEQ ID NO: 7 


TGB7 


AF1 54422 


deposited May 20, 1999 
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Throughout the following discussion, wherever TBG4 is indicated in the 
description of the invention, it is to be understood that TBG1-3 and 5-7 are 
also to be included in that description, unless otherwise indicated. 

A method of providing a DNA sequence of the invention, either by 
cloning a cDNA (for instance, pZBG2-l-4) that codes for a protein of the 
present invention, such as (3-galactosidase II, or by deriving the DNA sequence 
from genomic DNA, or by synthesis of a DNA sequence ab initio using the 
cDNA sequence as a guide is also provided. 

A method for modifying cell wall metabolism which involves modifying 
the activity of at least one galactosidase, and thus modifying the quality of the 
fruit is also provided. 

Also provided by the present invention is a DNA construct including 
some or all of an exemplary ^-galactosidase DNA sequence under control of a 
transcriptional initiation region operative in plants, so that the construct can 
generate RNA in plant cells. 

Also discovered is an enhancer/promoter associated with expression of 
the genes encoding P-galactosidase. 

The present invention also relates to recombinant vectors, which include 
the isolated nucleic acid molecules of the present invention, and to host cells 
containing the recombinant vectors, as well as to methods of making such 
vectors and host cells and for using them for production of fj-galactosidase 
polypeptides or peptides by recombinant techniques. 
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The present invention also provides plant cells containing DNA 
constructs of the present invention; plants derived therefrom having modified 
P-galactosidase gene expression; and seeds produced from such plants. 

The p-galactosidase II protein of the present invention has demonstrated 
enzyme activity in cell wall disassembly leading to loss of tissue integrity and 
fruit softening. The p-galactosidase II protein also may be involved in cell 
wall turnover, which could be involved in cell extension and/or expansion and 
therefore plant growth and development. 

By hydrolyzing galactose from the cell wall, the enzyme may allow 
ripening to commence and/or progress, since galactose may be involved in 
stimulating ethylene production alone or in conjunction with unconjugated N- 
glycans. 

The [3-galactosidase of the invention may be involved in conversion of 
chloroplasts (green - chlorophyll) to chromoplasts (red - lycopene) during 
fruit ripening by degrading chloroplast membrane galactolipids. 

The family of genes represented by the nucleotide sequences shown in 
Figure 2 is expected to code for a group of similar enzymes with the same 
type of hydrolytic activity but with different tissue and/or substrate 
specificity's or cellular compartmentation profiles. 

The p-galactosidase II protein of the present invention as well as other 
proteins encoded in the nucleotide sequences shown in Figure 2 may be used 
for preparation of pectin and other cell wall derived polymers with lowered 
galactosyl content for use in biofilms and solutions (for example in 
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clarification of fruit juices) requiring lower or higher cross-linking or 
viscomertric properties. 

The present invention also provides (3-galactosidase enzymes for use as 
components of enzyme mixtures for protoplast isolation. 

Brief Description of the Figures 

Figure 1A and IB shows a phylogenic tree based on shared amino acid 
sequence identity among tomato [3-galactosidase clones TGB1-7 and other 
known plant (J-galactosidase polypeptides. 

Figure 2 shows cDNA sequences [SEQ ID NOs: 1-7, respectively] for the 
seven (3-galactosidase genes of the invention: TGB1, TGB2, TGB3, TGB4, 
TGB 5, TGB6, TGB7. 

Figure 3 shows multiple sequence alignment of the deduced amino acid 
sequences of tomato fruit for cDNA clones TGB 1 , TGB2, TGB3, TGB4, 
TGB 5, TGB 6 and TGB 7 [SEQ ID NOs: 8-16, respectively] and various plant 
P-galactosidase cDNA clones. 

Figure 4 shows autoradiograph of northern blot analysis of TBG expression in 
various plant tissues (flowers, leaves, roots and stems). 

Figure 5 shows Autoradiograph of northern blot analysis of TBG expression 
in fruit tissues at different stages of development. 
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Figure 6 shows autogradiograph of northern blot analysis of TBG expression 
in fruit tissues (mature green or turning stage fruit peel, outer pericarp, inner 
paricarp and 1 ocular). 

Figure 7 shows autoradiograph of northern blot analysis of TBG expression in 
normal and mutant fruit tissues. 

Figure 8 shows autoradiograph of northern blot analysis of TBG expression in 
response to ethylene treatment of mature green fruit tissues. 

Figure 9 shows Western blot analysis of TBG4 expression by yeast. 

Figure 10 shows detection of (3-galactosidase activity from pZBG2-l-4 
expression in E. coli. 

Figure 11 A - E (1-4) shows the comparative results of texture measurements 
for fruit from tomato plants containing antisense constructs to suppress TBG4 
mRNA and fruit from the parental line. 

Figures 12A - B show Northern blot analysis of TBG4 expression in 
transgenic fruit containing TBG4 antisense construct. 

Figure 13 shows a Binary construct used to transform plants and express 
TBG4 (pZBG2-l-4) in the antisense orientation. 
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Detailed Description 

The following detailed description is directed to a preferred 
embodiment of the present invention and is intended as illustrative of each of 
other DNA sequences of the present invention. 

The present invention provides isolated nucleic acid molecules 
comprising a polynucleotide encoding [3-galactosidase polypeptides, 
particularly a p-galactosidase II polypeptide having the amino acid sequence 
shown in Figure 2. The DNA sequence of the exemplary fi-galactosidase II 
cDNA clone of the invention, which was determined from a cDNA clone, 
pZBG2-l-4, encoding (i-galactosidase II, is recorded in GenBank as Accession 
Number AF020390. Not all p-galactosidases possess in vitro activity against 
extracted cell wall material via the release of galactose from wall polymers 
containing P(l-*4)-D-galactan. The polypeptide expressed from the exemplary 
p-galactosidase H clone, pZBG2-l-4, has been shown to exhibit 
P-galactosidase activity and exogalactinase activity. 

The exemplary P-galactosidase II protein of the present invention, as 
shown in Figure 2, shares sequence homology with the amino acid sequence 
deduced from P-galactosidase cDNA clones of TBG2-7 and cDNA clones of 
the fruits of asparagus (accession number P45582), apple (accession number 
P48981), and carnation (accession number Q00662), as well as with p- 
galactosidase cDN A clones of a previously published sequence of a tomato p- 
galactosidase cDNA clone designated pTompgall (accession number P48980) 
isolated from ripe *Ailsa Craig' fruit (Carey et ai, 1995). The ORF of the 
clone TBG1 disclosed herein by the inventors (accession number AF023847) 
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is nearly identical to the cDNA previously described by Carey et al. As shown 
in Figure 2, the shared deduced sequence identity is high among all the 
published plant p-galactosidases of the seven clones (TBG1-7) and the other 
plant P-galactosidases. 

BLAST searches of the database also indicated significant shared 
sequence identity between domains of the plant P-galactosidases and 
mammalian and fungal p-galactosidases, however little share sequence identity 
was detected with bacterial P-galactosidases. 

As shown in Figure 1, the shared amino acid identity of TBG1 and 
TBG3 was high. TBG4 was also very similar to both TBG1 and 3. The amino 
acid sequences of TBG2 and 7 were unique because several regions of amino 
acid insertions appear throughout their sequence (Figure 3). 



Nucleic Acid Molecules 

Unless otherwise indicated, all nucleotide sequences determined by 
sequencing a DN A molecule herein were determined using a PCR-based 
dideoxynucleotide terminator protocol and an ABI automated DNA sequencer 
(such as the Model 373 from Applied Biosystems, Inc., Foster City, CA), and 
all amino acid sequences of polypeptides encoded by DNA molecules 
determined herein were predicted by translation of a DNA sequence 
determined as above. Therefore, as is known in the art for any DNA sequence 
determined by this automated approach, any nucleotide sequence determined 
herein may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90% identical, more typically at least 

) 
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about 95% to at least about 99.9% identical to the actual nucleotide sequence 
of the sequenced DNA molecule. The actual sequence can be more precisely 
determined by other approaches including manual DNA sequencing methods 
well known in the art. As is also known in the art, a single insertion or 

5 deletion in a determined nucleotide sequence compared to the actual sequence 

will cause a frame shift in translation of the nucleotide sequence such that the 
predicted amino acid sequence encoded by a determined nucleotide sequence 
will be completely different from the amino acid sequence actually encoded by 
the sequenced DNA molecule, beginning at the point of such an insertion or 

10 deletion. 

By "nucleotide sequence" of a nucleic acid molecule or polynucleotide 
is intended, for a DNA molecule or polynucleotide, a sequence of 
deoxyribonucleotides, and for an RNA molecule or polynucleotide, the 
corresponding sequence of ribonucleotides (A, G, C and U), where each 

15 thymidine deoxyribonucleotide (T) in the specified deoxyribonucleotide 

sequence is replaced by the ribonucleotide uridine (U). 

Using the information provided herein, such as the exemplary nucleotide 
sequence shown in Figure 2 [SEQ ID NO: 4], a nucleic acid molecule of the 
present invention encoding a [}-galactosidase II polypeptide may be obtained 

20 using standard cloning and screening procedures, such as those for cloning 

cDNAs using mRNA as starting material. Illustrative of the invention, the 
nucleic acid molecule described in Figure 2 [SEQ ID NO: 4] was discovered 
in a cDNA library derived from breaker, turning and pink fruit pericarp from 
'Rutgers' tomato plants. 
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The complete sequence of the cDNA insert of pZBG2-l-4 is accessible in 
the GenBank (no. AF020390) and is provided in Figure 2 [SEQ ID NO: 4]. 
The cDNA insert is 2532 nucleotides (nt) long and contains a single, long open 
reading frame (ORF) predicted to start with the first in-frame ATG at nt 64 
and end with TAA at nt 2238. This ORF codes for a 79 kD protein 724 amino 
acids long. The deduced amino acid sequence of pZBG2-l-4 shared 
significant amino acid identity to all published plant P-galactosidase sequences 
in the database (Figure 1 A,B). When the entire ORF of each (3-galactosidase 
gene was compared to pZBG2-l-4, the shared sequence identity was about 
64% for tomato pTom(5gal 1 (P48980), about 67.6% for apple (P48981), about 
63% for asparagus (P45582) and about 55% for carnation (Q00662). As one 
of ordinary skill would appreciate, due to the possibilities of sequencing errors 
discussed above, the actual complete (5-galactosidase II polypeptide encoded 
by the deposited cDNA, which comprises about 724 amino acids, may be 
somewhat longer or shorter. More generally, the actual open reading frame 
may be anywhere in the range of ±20 amino acids, more likely in the range of 
±10 amino acids, of that predicted from either the first methionine codon from 
the N-terminus shown in Figure 2 [SEQ ID NO: 4]. In any event, as discussed 
further below, the invention further provides polypeptides having various 
residues deleted from the N-terminus of the complete polypeptide, including 
polypeptides lacking one or more amino acids from the N-terminus of the (}- 
galactosidase II polypeptide described herein. 
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Leader and Mature Sequences 

Analysis of the deduced amino acid sequence of pZBG2-l-4 suggested 
a high probability for secretion based on the presence of a hydrophobic leader 
sequence, a leader sequence cleavage site and three possible N-glycosylation 
sites. The programs PSORT V6.4 (Nakai and Kanehisa, 1992, incorporated 
herein by reference) and SignalP VI. 1 (Nielsen et al., 1997, incorporated 
herein by reference), were used to predict that the ORF contains a hydrophobic 
leader sequence that would be cleaved between the alanine and serine residues 
at positions 23 and 24 respectively, and that the mature polypeptide has an 
extracellular location. The mature polypeptide contains three possible N- 
glycosylation sites at asparagine numbers 282, 459 and 713, however the 
asparagine at position 713 is unlikely to be glycosylated due to the proline at 
position 714. The predicted molecular mass of the unglycosylated mature 
polypeptide was 75 kD with a pi of 8.9. 

Accordingly, the amino acid sequence of the complete p-galactosidase 
II protein of the invention includes a leader sequence and a mature protein, as 
shown in Figure 3 [SEQ ID NO: 4]. More in particular, the present invention 
provides nucleic acid molecules encoding a mature form of the p-galactosidase 
II protein. Thus, according to the signal hypothesis, secreted proteins have a 
signal or secretory leader sequence which is cleaved from the complete 
polypeptide to produce a secreted "mature" form of the protein. In some cases, 
cleavage of a secreted protein is not entirely uniform, which results in two or 
more mature species of the protein. Further, it has long been known that the 
cleavage specificity of a secreted protein is ultimately determined by the 
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primary structure of the complete protein, that is, it is inherent in the amino 
acid sequence of the polypeptide. Therefore, the present invention provides a 
nucleotide sequence encoding the mature p-galactosidase II polypeptide 
having the amino acid sequence encoded by the cDNA shown in Figure 2 
[SEQ ID NO: 4] and provided in GenBank (Accession No. AF20390). By the 
"mature (5-galactosidase II polypeptide having the amino acid sequence 
encoded by the cDNA clone shown in Figure 2 [SEQ ID NO: 4] is meant the 
mature form(s) of the (3-galactosidase II protein produced by expression in a 
plant cell of the complete open reading frame encoded by the cDNA sequence 
of the clone shown in Figure 2 [SEQ ID NO: 4] and provided in GenBank 
(Accession No. AF20390). 

The exemplary p-galactosidase II cDNA of the present invention 
(TBG4) has been expressed in E. coli strain XLI blue MR (lacZ) (Stratagene, 
La Jolla, CA), as described hereinbelow (see Example). 

Analysis of the deduced amino acid sequence of cDNA clones 
representing the other p-galactosidase genes of the invention also revealed 
open reading frames and, in some cases, suggested a high probability for 
secretion of the encoded proteins. All the full-length cDNA clones were 
predicted to have a signal sequence (Fig. 2). Using the two prediction 
programs SignalP and PSORT, TBG4 was predicted to be secreted by both 
programs. TBG1, 2 and 3 were predicted to have cleavable signal sequences 
by SignalP, but uncleavable signal sequences by PSORT. TBG7 was suggested 
to be targeted to the chloroplast by PSORT. Particular observations for each 
of the seven clones are as follows, based on the presence of a hydrophobic 



WO 99/64564 PCT/US99/12697 

16 

leader predicted by the programs PSORT V6. and SignalP VI . 1 : TBG1 : 
initiation codon at 306 [SEQ ID NO: 1], ORF = 835 amino acids [SEQ ID 
NO: 8], signal sequence at 1-24; TBG2: initiation codon not determined [SEQ 
ID NO: 2], ORF = 888 amino acids [SEQ ID NO: 9], signal sequence at 1-25; 
TBG3: initiation codon at 32 [SEQ ID NO: 3], ORF = 838 amino acids [SEQ 
ID NO: 10], signal sequence at 1-22; TBG5: initiation codon not determined 
[SEQ ID NO:5], ORF = 251 amino acids [SEQ ID NO: 12], signal sequence 
not determined; TBG6: initiation codon not determined [SEQ ID NO:6], ORF 
= 248 amino acids [SEQ ID NO: 13], signal sequence not determined; TBG7: 
initiation codon at 104 [SEQ ID NO: 7], ORF = 870 amino acids [SEQ ID 
NO: 14], signal sequence at 1-35. 

The deduced amino acid sequences of the seven clones was also 
subjected to analysis using the program DNAsis and the predictions for 
molecular mass, cellular targeting, pi and potential N-linked glycosylation 
sites are summarized in Table L 
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Table I. Tomato (3-galactosidase (TBG) cDNA sequence data. Fiv 
full-length and 2 partial-length cDNAs were cloned and sequenced. 
The DNA and deduced amino acid sequence data is presented 
below 



CLONE 


mRNA(kb) 


kD 


Pi 


N-LINK 


TARGET 


TBG1 


3.2 


90.8 


6.2 


2 


ER/OUT 


TBG2 


3.0 


97.0 


6.2 


6 


PM 


TBG3 


2.8 


90.5 


8.2 


1 


ER/OUT 


TBG4 


2.6 


77.9 


8.9 


3 


OUT 


TBG5 


~3 










TBG6 


~3 










TBG7 


3.0 


93.3 


8.0 


6 


CHLOR 



N-LINK = possible N-linked glycosylation sites; ER = endoplasmic 
reticulum; out = secreted; PM = tethered to plasma membrane; CHLOR 
= chloroplast 



As indicated, nucleic acid molecules of the present invention may be in 
the form of RNA, such as mRNA, or in the form of DNA, including, for 
instance, cDN A and genomic DNA obtained by cloning or produced 
synthetically. The DNA may be double-stranded or single-stranded. 
Single-stranded DNA or RNA may be the coding strand, also known as the 
sense strand, or it may be the non-coding strand, also referred to as the 

anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid 
molecule, DNA or RNA, which has been removed from its native environment 
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For example, recombinant DNA molecules contained in a vector are 
considered isolated for the purposes of the present invention. Further 
examples of isolated DNA molecules include recombinant DNA molecules 
maintained in heterologous host cells or purified (partially or substantially) 
DNA molecules in solution. Isolated RNA molecules include in vivo or in 
vitro RNA transcripts of the DNA molecules of the present invention. Isolated 
nucleic acid molecules according to the present invention further include such 
molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) with an initiation codon at 
position 64 of the nucleotide sequence shown in Figure 2 [SEQ ID NO: 4]. 
Also included are DNA molecules comprising the coding sequence for the 
mature (3-galactosidase II protein shown at positions 135-2532 of Figure 2 
[SEQ ID NO: 4]. 

In addition, isolated nucleic acid molecules of the invention include 
DNA molecules which comprise a sequence substantially different from those 
described above but which, due to the degeneracy of the genetic code, still 
encode the (3-galactosidase II protein. Of course, the genetic code and species- 
specific codon preferences are well known in the art. Thus, it would be 
routine for one skilled in the art to generate the degenerate variants described 
above, for instance, to optimize codon expression for a particular host (e.g., 
change codons in the plant mRNA to those preferred by a bacterial host such 
as E. coli). Preferably, this nucleic acid molecule will encode the mature 
polypeptide encoded by the above-described deposited cDNA clone. 
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The invention further provides an isolated nucleic acid molecule 
having the nucleotide sequence shown in Figure 2 [SEQ ED NO: 4] or a 
nucleic acid molecule having a sequence complementary to the above 
sequence. Such isolated molecules, particularly DNA molecules, are useful as 
probes for gene mapping, by in situ hybridization with chromosomes, and for 
detecting expression of the (i-galactosidase II gene in plant tissue, for instance, 
by Northern blot analysis. 

The present invention is further directed to nucleic acid molecules 
encoding portions of the nucleotide sequences described herein as well as to 
fragments of the isolated nucleic acid molecules described herein. In 
particular, the invention provides a polynucleotide having a nucleotide 
sequence representing the portion of Figure 2 [SEQ ID NO: 4] which consists 
of positions 1-2538 of Figure 2 [SEQ ID NO: 4]. 

In addition, the invention provides additional nucleic acid molecules 
having nucleotide sequences related to extensive portions of Figure 2 [SEQ 
ID NO: 4] which have been determined from the following related cDNA 
clones: TBG1-3 and TBG5-7 as shown in Figure 3, SEQ. NO's 1-3 and 5-7 

In another aspect, the invention provides an isolated nucleic acid 
molecule comprising a polynucleotide which hybridizes under stringent 
hybridization conditions to a portion of the polynucleotide in a nucleic acid 
molecule of the invention described above, for instance, the cDNA clone 
shown in Figure 2 [SEQ ID NO: 4]. By "stringent hybridization conditions 1 ' is 
intended overnight incubation at 42° C in a solution comprising: 50% 
formamide, 5x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 pg/ml 
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denatured, sheared salmon sperm DNA, followed by washing the filters in 
O.lx SSC at about 65° C. 

As indicated, nucleic acid molecules of the present invention which 
encode a fj-galactosidase II polypeptide may include, but are not limited to 
those encoding the amino acid sequence of the mature polypeptide, by itself; 
and the coding sequence for the mature polypeptide and additional sequences, 
such as those encoding the about 1-23 amino acid leader sequence, such as a 
pre-, or pro- or prepro- protein sequence; the coding sequence of the mature 
polypeptide, with or without the aforementioned additional coding sequences. 

Also discovered is an enhancer/promoter associated with expression of 
the genes encoding p-galactosidase. The inventors have characterized the 
expression profile of TBG2 mRNA and have cloned a lambda genomic cDNA. 
TBG2 is expressed before the onset of fruit ripening and continues at uniform 
level throught all the ripening stages. TBG2 has been found to be expressed in 
all fruit tissues and has also been found to be fruit specific. Experiments have 
shown TBG2 to be unaffected by ethylene. TBG2 is expressed in the ripening 
mutants rin, nor and Nr at the normal chronological time after anthesis. The 
promoter discovered would be useful to express any gene in the sense or 
antisense orientation, specifically in tomato fruit, in all tomato fruit tissues, 
starting before and continuing throughout the entire ripening process. The 
promoter could also be used to express any gene in the ripening mutants rin, 
nor and Nr without the need to gas the fruit with exogenous ethylene. 
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Variant and Mutant Polynucleotides 

The present invention further relates to variants of the nucleic acid 
molecules of the present invention, which encode portions, analogs or 
derivatives of the P-galactosidase II protein. Variants may occur naturally, 
such as a natural allelic variant. By an "allelic variant" is intended one of 
several alternate forms of a gene occupying a given locus on a chromosome of 
an organism. Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985). 
Non-naturally occurring variants may be produced using art-known 
mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, 
deletions or additions. The substitutions, deletions or additions may involve 
one or more nucleotides. The variants may be altered in coding regions, 
non-coding regions, or both. Alterations in the coding regions may produce 
conservative or non-conservative amino acid substitutions, deletions or 
additions. Especially preferred among these are silent substitutions, additions 
and deletions, which do not alter the properties and activities of the (3- 
galactosidase II protein or portions thereof. Also especially preferred in this 
regard are conservative substitutions. 

Most highly preferred are nucleic acid molecules encoding the mature 
protein having the amino acid sequence shown in Figure 2 as pZBG2-l-4 or 
the mature {3-galactosidase II amino acid sequence encoded by the deposited 
cDNA clone. 

Further embodiments include an isolated nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 90% 
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identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical 
to a polynucleotide selected from the group consisting of: (a) a nucleotide 
sequence encoding the (5-galactosidase II polypeptide having the complete 
amino acid sequence in Figure 2 [SEQ ID NO: 4] (b) a nucleotide sequence 
encoding the mature p-galactosidase II polypeptide shown in Figure 2 [SEQ 
ID NO: 4]; (c) a nucleotide sequence complementary to any of the nucleotide 
sequences in (a) or (b) above. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of {3- 
galactosidase II polypeptides or fragments thereof by recombinant techniques. 
The vector may be, for example, a phage, plasmid, viral or retroviral vector. 
Retroviral vectors may be replication competent or replication defective. In 
the latter case, viral propagation generally will occur only in complementing 
host cells. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in 
a precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a virus, it may be packaged in vitro using an 
appropriate packaging cell line and then transduced into host cells. 

The DNA insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA 
and tac promoters, the SV40 early and late promoters and promoters of 
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retroviral LTRs, to name a few. Other suitable promoters will be known to the 
skilled artisan. The expression constructs will further contain sites for 
transcription initiation, termination and, in the transcribed region, a ribosome 
binding site for translation. The coding portion of the transcripts expressed by 
the constructs will preferably include a translation initiating codon at the 
beginning and a termination codon (UAA, UGA or UAG) appropriately 
positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G418 or 
neomycin resistance for eukaryotic cell culture and tetracycline, kanamycin or 
ampicillin resistance genes for culturing in E. coli and other bacteria. 
Representative examples of appropriate hosts include, but are not limited to, 
bacterial cells, such as E. coli, StrepZBG2-l-4yces and Salmonella 
typhimurium cells; fungal cells, such as yeast cells; insect cells such as 
Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS, 293 
and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 
and pQE-9, available from QIAGEN, Inc., supra; pBS vectors, Phagescript 
vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, available 
from Stratagene; and ptrc99a, pKK223-3, pKK233-3, pDR540, pRTTS 
available from Pharmacia. Among preferred eukaryotic vectors are 
pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and 
pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other suitable 
vectors will be readily apparent to the skilled artisan. 
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Introduction of the construct into the host cell can be effected by 
calcium phosphate transfection, DEAE-dextran mediated transfection, cationic 
lipid-mediated transfection, electroporation, transduction, infection or other 
methods. Such methods are described in many standard laboratory manuals, 
such as Davis et al., Basic Methods In Molecular Biology (1986). 

Example 

Tomato (Lycopersicon esculentum Mill., cv. 'Rutgers') plants were grown 
in a greenhouse using standard cultural practices. The ripening mutants, 
ripening inhibitor (rin), non-ripening (nor) and never ripe (Nr) (Tigchelaar et 
al, 1978), were all in the 'Rutgers' background. Flowers were tagged at 
anthesis and fruit were harvested according to the number of days post- 
anthesis (dpa) or based on their surface color using ripeness stages as 
previously described (Mitcham et al, 1989), the complete disclosure of which 
is hereby fully incorporated herein by reference. For gene expression studies, 
a variety of leaf, flower, and stem tissues were harvested from greenhouse- 
grown plants and roots were harvested from seedlings grown in basal tissue 
culture medium for 4 weeks after seed germination. 

RNA Extraction 

Fruits were processed immediately after harvest in the greenhouse by 
chilling on ice, excising the various tissues and freezing them in liquid 
nitrogen. Tissue samples were ground using a mortar and pestle and stored at - 
80°C. RNA was extracted using the method described in Verwoerd et al. 
(1989). Poly(A)RNA was purified from total RNA using oligo(dT) columns 
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(Pharmacia, Piscataway, NJ). RNA was quantified by measuring A 2 6o using a 
dual beam spectrophotometer. 



RT-PCR 

5 Degenerate primers were designed based on the highest shared deduced 

amino acid sequence identity we found between an apple (accession number 
P48980), asparagus (P45582) and carnation (Q00662) p-galactosidase cDNA 
clones. The two primers used for the first reaction were BGS'El 
(WSNGGNWSNATHCAYTAYCC) and BG3E 

10 (CCRT A YTCRTCN ADNGGNGG) . A second reaction was done on the 

products of the first reaction using BG5'U 

(ATHCARACNTAYGTNTTYTGG) and BG3E. The degeneracy code for the 
primer sequences is N=a-H+c+g; H-a+t+c; B=t+c+g; D=a+t+g; V=a+c+g; 
R=a+g; Y=c+t; M=a+c; K=t+g; S=c+g; and W=a+t. The 5' and 3' primers 

15 corresponded to amino acids 72-78 and 321-315 of the apple clone, 

respectively. Amplification was done using AmpliTaq DNA polymerase 
(Perkin Elmer, Norwalk, CT) and standard PCR conditions using the cDNA 
made for the first cDNA library described below as a template (Ausubel et ah, 
1987). PCR products were separated in an agarose gel and fragments of the 

20 expected size (approximately 750 bp) were purified, cloned into pCRscript 

(Stratagene, La Jolla, CA) , and sequenced. 
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cDNA library 

Two cDNA libraries were constructed. The first comprised poly(A) RNA 
isolated from breaker, turning and pink fruit pericarp from 'Rutgers' plants. 
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The cDNA synthesis and library construction was done exactly according to 
the manufacturers instructions for the ZAP-cDNA Gigapack II Gold Cloning 
Kit (Stratagene), the complete disclosure of which is fully incorporated herein 
by reference. First-strand cDNA synthesis was primed using a poly(dT) 
primer and inserts were directionally cloned into the Uni-Zap XR vector using 
EcoRI and Xhol restriction sites. The second library comprised poly(A) RNA 
isolated from all fruit tissues (except seeds) from immature green, mature 
green, breaker, turning, pink, red-ripe and over-ripe fruit of 'Rutgers' plants. 
The cDNA synthesis and library construction was done exactly according to 
the manufacturers instructions for the Superscript Lambda System for cDNA 
synthesis and • Cloning (GibcoBRL, Gaithersburg, MD). First-strand cDNA 
synthesis was primed using a oligo(dT) primer and cDNA inserts were 
directionally cloned into the • ZipLox cloning vector using Sail and NotI 
restriction sites. Both libraries were amplified and maintained using the host 
strains provided by the manufacturer, according to their instructions. 

One of the clones (RT-PCR2-1) was used to screen 10 6 plaques from the 
tomato fruit cDNA libraries at low stringency (hybridization at 45°C, no 
formamide and final wash with 0.2X SSC at 42°C). Thirty positive cDNA 
clones were identified and partially sequenced. Complete sequencing and 
characterization of the RT-PCR and cDNA clones revealed the possibility of 
seven unique [}-galactosidase genes. 
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DNA and RNA Gel Blot Analysis 

Southern analysis was done using the 3' UTR of each full length clone 
and the RT-PCR clones as probes against restriction enzyme digested genomic 
DNA. DNA gel blot analysis was done essentially as described in Smith and 

5 Fedoroff (1995) except that 3 jxg of genomic DNA was used for each digest. 

The genes corresponding to the clones appeared to be present as single copies 
(data not shown). The same probes were used to map 6 of the 7 genes using 
RFLPs of recombinant inbred lines and the loci names and map positions are 
shown in Table II (James Gioviannone, Texas A&M University, personal 

10 communication). 



Table IL TBG loci map positions. Genes were maped by Southern 
analysis using RFLPs of recombinant inbred lines. 



Gene 


chromosome 


map position 


TBG1 


12* 


overlap of IL 12-2, IL 12-3 


TBG2 


9 


IL9-3 


TBG3 


3 


IL3-5 


TBG4 


12* 


overlap of IL 12-2, IL 12-3 


TBG5 


11 


IL 11-3 


TBG6 


2 


overlap of IL 2-4, IL 2-5 


TBG7 


no RFLP 





*TBG1 and 4 are loosely linked 



15 



Total RNA (20 \ig/ lane) was separated in a formaldehyde/Mops 
agarose gel, transferred to Hybond-N + nylon membrane (Amersham, Arlington 
Heights, EL), fixed by incubating for 2 h at 80°C, hybridized overnight in a 
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hybridization incubator (Robbins Scientific, Sunnyvale, CA) using a buffer 
described by Church and Gilbert (1984) washed to a final stringency of 0.1 X 
SSC with 0.2% SDS at 65°C, and autoradiographed essentially as described by 
Ausubel et al (1987). An RNA ladder standard (GibcoBRL) was used to 

5 estimate the length of the RNAs. Probes were synthesized using a random 

priming kit with 32 P-dATP as the label (Boehringer Mannheim, Indianapolis, 
IN). Northern analysis was done using the 3' UTR of each full length clone 
and the RT-PCR clones as templates for probe synthesis. As a loading control, 
RNA blots were stripped and re-probed at a reduced hybridization and 

10 washing stringency using a soybean 26S rDNA fragment (Turano et al., 1997). 

For all hybridizations, 32 P(dATP)-labeled probe was diluted to 1-2 x 10 6 
dpm/mL. The complete disclosures of the above references are fully 
incorporated herein by reference. 



15 Sequence Analysis 

Sequencing was done at the Iowa State University Sequencing Facility 
(Ames, IA) using a PCR-based dideoxynucleotide terminator protocol and an 
ABI automated sequencer (Applied Biosystems, Foster City, CA). The 
sequencing of both cDNA insert strands was done by primer walking. 
20 Nucleotide and deduced amino acid sequence comparisons against the 

databases were done using BLAST searches (Altschul et al, 1990). Sequence 
data were analyzed and aligned using DNA Strider 1.2 (Marck, 1988) and 
MacDNAsis (Hitachi, San Bruno, CA) software. The complete disclosures of 
the above references are fully incorporated herein by reference. 
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Northern Blot Analysis 
Tissue Specific Expression 

Northern blot analysis was done to reveal which, if any, of the (3- 
galactosidase genes had a fruit-specific expression pattern. With the exception 
of TBG2, transcripts of all clones were detected in non-fruit tissues (Fig. 4). 
Transcripts of TBG 1, 4, 5 and 6 were detected in all the tissues tested. TBG3 
transcript was detected at low levels in root and stem tissues, while TBG7 
transcript was detected in flower and stem tissues. 



Temporal expression pattern in fruit 

The temporal expression pattern of the seven genes in fruit tissue was 
examined using RNA extracted from all fruit tissues except seeds. Transcripts 
for all seven genes were detected during some stage of fruit development (Fig. 
5). TBG1 and 3 had similar expression patterns and their transcripts were 
detected throughout the breaker to over-ripe stages. TBG2 had a unique 
expression pattern and its transcript was detected at a constant level from 30 
dpp to the over ripe stage. TBG4 expression pattern was similar to TBG1 and 
3, but differed in that the transcript level was significantly higher at the turning 
stage. TBG5 had a similar expression pattern to TBG4 during the ripening 
stages of development, however TBG5 transcript was also detected throughout 
all the earlier stages of fruit development. TBG6 had an interesting expression 
pattern and its transcript was only detected at high levels in all pre-ripening 
stages tested. TBG7 also had a unique expression pattern and its transcript was 
detected at very low levels throughout all the stages tested, and at moderate 
levels at 10 dpp and the over-ripe stage. 
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Spatial expression pattern in fruit 

Northern blot analysis was also done to determine transcript 
accumulation in various fruit tissues. Since there were temporal differences in 
the expression patterns of the TBG genes both the mature green and turning 
fruit stages were used for RNA extractions (Fig. 6). Both TBG2 and TBG6 
transcripts were detected in all mature green fruit tissues tested. TBG7 
transcript was present in all fruit tissues tested except for locules. Both TBG1 
and TBG4 transcripts were detected in RNA samples extracted from all 
turning stage fruit tissues. TBG4 transcript was notably more abundant in the 
peel. TBG3 and TBG5 expression patterns were unique and their transcripts 
were detected in all tissues except the outer pericarp and locular respectively. 

Expression in normal versus mutant fruit 

In order to better understand the potential roles of the TBG products 
and transcriptional regulatory mechanisms, northern analysis was performed 
using fruit tissue from the ripening mutants rin, nor and Tf. This analysis was 
important because it might give clues for preliminary determination of any 
potential ripening and/or softening role any of the TBGs might possess. 

The results of mutant fruit Northern analysis suggested that the 
transcriptional regulation of TBG1, 2, 3, 5 and 7 was unaffected in mutant fruit 
tissue and that their transcripts were present in a normal chronological (dpp) 
pattern (Fig. 7). The abundance of TBG4 and 6 transcripts were however 
different in the mutant fruit. TBG4 transcript was not detected in fruit tissue of 
bf and was detected at much lower levels in rin and nor than wild type fruit 
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tissues. Normally TBG6 transcripts are detectable at high levels throughout the 
early stages of fruit development but are not detectable after the mature green 
stage (40-42 dpp). TBG6 transcripts persisted even to 50 dpp in fruit of all 
three mutants. 

Transcriptional regulation by ethylene 

The northern analysis done using mutant and wild type fruit suggested 
that TBG4 expression might be up-regulated by ethylene and that TBG6 
expression might be down-regulated by ethylene. In order to evaluate this 
hypothesis mature green fruit were harvested and subjected to a continuous 
flow of 10 ppm ethylene mixed in air. Control and ethylene-treated fruit were 
used for RNA extractions at 1 , 2, 12 and 24 hours. The results of this 
experiment confirmed the findings from the mutant fruit northern analysis. As 
expected, the presence and abundance of TBG1, 2, 3, 5 and 7 transcripts was 
essentially unaffected in mature green tissues subjected to exogenous ethylene 
treatment (Fig. 8). However, TBG4 transcript abundance was increased in 
mature green tissues in the presence of ethylene. From the data presented it 
was unclear whether TBG6 transcript abundance was reduced by exogenous 
ethylene treatment since its transcript level was normally reduced at this stage 
of fruit development. 

Enzyme activity 

In order to determine the role of the TBG encoded products we 
initiated experiments to express the cDNA encoded enzymes using 
heterologous expression systems. Several E. coli expression systems were 
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tested, but the yield of product was very low due to toxicity ( See the example 
below). Therefore we used a yeast expression system which secretes a mature 
amino-terminal-FLAG fusion protein into the culture medium. The TBG4 
cDNA was tested first and resulted in the production of approximately 1 mg 
TBG4 active protein per 50 mis culture. TBG4 was used first because the 
cDNA codes for the enzyme P-galactosidase II which was purified from 
tomato fruit and has been characterized in some detail (Carey et al 1995, Smith 
et al 1998). Therefore we could compare the activity of the heterologous 
system-expressed protein to the native enzyme purified from tomato. The 
TBG4 protein was successfully affinity purified using an anti-FLAG affinity 
resin (Figure 9). 

The affinity-purified TBG4 enzyme was shown to have (3(1^4)-D- 
galactosidase activity by virtue of its ability to hydrolyze the synthetic 
substrate p-nitrophenyl-(3-D-galactopyranoside (Smith et al. 1998). The 
enzyme can cleave galactosyl residues from a variety of cell wall substrates 
and therefore has exo-galactanase activity (Table IE). The remaining full- 
length cDNA clones are currently being tested for successful expression of 
active enzyme. Preliminary results have shown that TBG1 codes for an 
enzyme which also has both p-D-galactosidase and exo-galactanase activity 
(Table HI). 
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Table III. Cell wall degrading activity of TBG4 and TBG1 expressed 


in yeast. Removal of galactosyl residues from chelator soluble (CSP) 


and alkali soluble (ASP) pectin and hemiceilulosic (HCF) cell wall 


fractions purified from tomato fruit. 






fig galactose 




released 


enzyme substrate 


boiled live 


a TBG4 CSP 


0 5 


ASP 


0 14.5 


HCF 


0 4 


b TBG1 ASP 


0 1.2 


2 mg substrate; 4 hours at 37°C 




a .005 units enzyme/rx 




b .0005 units enzyme/rx 




pZBG2-l-4 Codes for a (3-Galactosidase 




The TBG4 ORF was cloned in-frame into the repressible/inducible 



bacterial expression vector pFLAG-CTC. The host strain XLl-Blue MR is a 



mutant strain containing no endogenous p-galactosidase activity nor a- 
complementation. Induction of gene transcription by (IPTG) caused the 
immediate cessation of E. coli growth at 30 to 37°C. However, induction at 
20°C did allow for some limited E. coli growth. When clones containing the 
pZBG2-l-4 4 ORF were grown at 20°C and induced with IPTG, the cells 
slowly turned blue after 36 hrs growth in medium containing the (3- 
galactosidase substrate X-GAL, (Figure 10). If not induced with IPTG, no 
blue color was seen, even after extended growth in media containing X-GAL. 
As an additional negative control, clones consisting of XLl-Blue MR 
transformed with the FLAG vector alone never showed any p-galactosidase 
activity with or without IPTG- induction, even after 7 -days growth (Fig 10). 
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As a positive control for maximal (3-galactosidase (derived from E. coli P- 
galactosidase) activity the cloning vector pGEM was transformed into the host 
strain DH5a and the results are also shown in Figure 10. Figure 10 shows the 
detection of J3-galactosidase activity from pZBG2-l-4 expression in E. coli. 
Cells were harvested and extracts were prepared every 12 hours and the A615 
measured. Cultures were grown with the addition of the chromogenic substrate 
X-GAL (open symbols) or X-GAL and the transcriptional inducer IPTG 
(closed symbols) in the medium. The vector used as a positive control for E. 
coli ^-galactosidase activity was pGEM (■) and the vector used as a negative 
control and for expression was pFLAG-CTC either without (om) or containing 
the pZBG2-l-4 ORF (a ,*). 

Effects on Plant Tissue Texture 

To further demonstrate the function of TBG4 encoded P-galactosidase II 
the following experiments were carried out. 

Fruit from tomato plants containing antisense constructs to suppress 
TBG4 mRNA were up to 40% firmer [compare means of parental line #1 with 
antisense line #2 in Figures 1 1 A - 1 lE(l-4)] than fruit from the parental line. 
Among the transformants the line with the firmest fruit also had the lowest 
overall levels of TBG4 mRNA (Figure 12A,B)- This correlation suggests that a 
reduction in TBG4 mRNA is associated with increased fruit firmness. Firmer 
fruit might result in (1) less shipping damage (a) less loss due to damage and 
(b) ability to harvest at later stage resulting in better flavor at market (2) longer 
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shelf life for both market and consumer. (3) better quality fruit for fresh slice 
market; fruit cut better at the pink/red stage when firmer. 



Methods 

To determine the function of TBG4 encoded p-galactosidase II, antisense 
constructs were made using the constitutively expressed 35S CaMV promoter 
to express TBG4 antisense RNA (Figure 13). Constructs were moved into 
tomato using Agrobacterium-mediated transformation. Four tomato cultivars 
have been transformed in order to evaluate the effect of TBG4 suppression on 
processing tomato (cv 'UC82b') fruit paste quality and three fresh pick 
cultivars. Of the fresh pick cultivars one is a soft fruit large cherry tomato (cv 
' Ailsa Craig'), the second is a soft fruit old breeding line (cv 'Rutgers') and 
the third is a recently developed somewhat firm cultivar 'New Rutgers'. 
Among the lines where TBG4 mRNA is suppressed we expect to observe an 
increase in firmness and paste viscosity. 

Texture 

Although this project is nearly finished the complete biochemical and 
molecular analysis is not finished. The preliminary results on the analysis of 
the 'New Rutgers' cultivar is presented in Figures 1 1 A - E(l-4) and 12A,B. In 
this example a fresh pick cultivar called 'New Rutgers' was used. Plants of the 
purchased seed were grown and allowed to self and the resulting seed was 
used as the parental control (line 1). Seven independent transformed plants 
(lines 2-8) containing TBG4 antisense constructs were grown and allowed to 
self. Transformation (T-DNA insertion) was confirmed by southern analysis 
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(data not shown). From each transformed line, five plants were grown along 
with 10 parental line plants. Fruit were tagged at the breaker stage (1 st onset of 
color change) and were harvested at breaker plus 7 days. Data were taken 
using 15-20 fruit from each line. Each type of texture measurement was done 
twice for each fruit and fruit were subjected to 4 types of texture 
measurements using a Stable Micro System's TA-XT2i texture analyzer. The 4 
measurements were; 1, 2-inch flat plate compression to 3 mm (Figure 1A), 2, 4 
mm spherical indenter compression to 3 mm (Figure 1 IB), 3, 4 mm cylindrical 
indenter compression to 3 mm (Figure 1 1C) and 4, 4 mm cylindrical indenter 
puncture to 10 mm (Figure 1 ID). The summary of this data is shown in 
Figure 1 lE(l-4). In Figures 1 1 A -E (1-4) line 1 was the parental line and lines 
2-8 each represent an independent transformant containing one T-DNA copy 
of the TBG4 antisense construct. Statistical analysis (Duncans and Scheffe) of 
the data revealed that fruit from the transformed lines 3, 7 and 8 were not 
significantly different from the parental line but that transformed lines 2, 4, 5 
and 6 were significantly firmer than the parental fruit. Most noteworthy is that 
fruit from transformed line 2 had fruit with a mean firmness that was 40% 
firmer than that of the parental line (Figures 1 1 A-D). 

Northern Blot Analysis 

We are currently investigating any changes in the biochemical 
composition of fruit where TBG4 mRNA levels have been suppressed. These 
experiments are designed to show a link between increased fruit firmness and 
TBG4 mRNA suppression, TBG4 encoded enzyme activity suppression, 
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possible cell wall modification (e.g. increased galactosyl residue content) and a 
decrease in free galactose levels during fruit ripening. 

These experiments are not complete, however some preliminary 
Northern blot experiments were done and the data is shown in Figure 12A,B- 
There is no parental or azygous control fruit RNA shown in Figure 12A,B 
because these plants were the last to grow and RNA extractions are just being 
done now. As a comparison of normal fruit TBG4 mRNA levels refer to 
Figure 5 above. The data from Figure 5 showed that TBG4 mRNA levels are 
low at the mature green stage, peak at the turning stage and are reduced at the 
red stage. All the lines except for 2 and 3 expressed antisense TBG4 mRNA 
(Figure 12A,B). The antisense transcripts appear as two bands, smaller in 
length than the endogenous mRNA. The two bands probably resulted from 1 , 
the expected transcriptional stop signal provided by the NOS -terminator and 2, 
a cryptic transcriptional stop signal in the antisense TBG4 cDNA. The most 
notable result was in line 2 where no TBG4 mRNA was detected at the turning 
stage. Line 2 also had the firmest red fruit (see Figure 1 1 A -D). The absence of 
detectable TBG4 mRNA probably was the result of cosupression of both the 
endogenous and antisense mRNAs. When compared to earlier blots (e.g. 
Figure 4), all of the lines appeared to have an overall reduced level of TBG4 
mRNA, but it is impossible to assign numbers to this statement without the 
parental and azygous control RNA on the same Northern blot. 

The specification discloses that P-galactosidase II polypeptide is involved 
in the degradation of cell wall pectin during fruit ripening. In the present 
invention, the role of (3-galactosidases in tomato during fruit ripening and 
softening and the description of the cloning of a p-galactosidase cDNA clone 
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that codes for a p(l-»4)galactan degrading enzyme, which is expressed in 
ripening tomato fruit tissues, has been shown. 

The present work indicates that pZBG2-l-4 is a cDNA derived from the 
transcript of the TBG4 gene which codes for (3-galactosidase II for the 
following reasons: 

First, the deduced amino acid sequence of the highly conserved amino- 
terminal portion of the expected mature pZBG2-l-4 translation product 
matches almost exactly (28 of 30 amino acids) with the amino-terminal 
sequence of (3~galactosidase II as purified by Carey et al. (1995) and 
designated TOMAA. Importantly, the two amino acids (KY) in the (i- 
galactosidase II sequence (TOMAA), that do not match the pZBG2-l-4 
deduced amino acid sequence of the present invention are believed to be 
incorrect since all plant (3-galactosidase sequences in the database and four 
additional (3-galactosidase-related cDNAs that were identified from tomato all 
match or have conserved substitutions with the deduced amino acid sequence 
of pZBG2-l -4 at these same two amino acid (ST) positions (Figure 3). 

Second, the transcript detected by pZBG2-l -4 is present in normal 
ripening fruit at the same time that (3-galactosidase II activity was detected 
(Figure 5; Carey et aL, 1995). Moreover, little or no transcript was detected 
in fruit at 45 and 50 dpa from the mutants nor, rin and Nr (Figure 7). This 
observation also coincides with the data presented by Carey et aL (1995) that 
P-galactosidase II activity remained at levels equal to mature green fruit and 
did not rise in fruit 45-65 dpa from nor or rin plants. Interestingly, Carrington 
and Pressey (1996) have .reported that (3-galactosidase II activity was only 
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detected in 'Rutgers' fruit after the turning stage of ripeness. The Northern 
data in the present invention indicates that maximum (5-galactosidase II 
activity occurs only after the turning stage, assuming mRNA levels predict 
extractable enzyme activity (Figure 5). 

Third, the apparent molecular weight of 77.9 kD and pi of 8.9 for the 
mature protein predicted from the pZBG2-l-4 sequence is similar to that 
determined for (3-galactosidase H, Pressey (1983), estimated a molecular 
weight of 62 kD by gel-filtration column chromatography and a pi of 7.8 by 
isoelectric focusing, while Carey et al (1995) estimated a molecular weight of 
75 kD by SDS-PAGE and a pi of 9.8 by isoelectric focusing. 

Fourth, enzyme produced from pZBG2-l-4 ORF using a heterologous 
yeast expression system has both [3-galactosidase activity and exogalactinase 
activity. 
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What we claim is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide 
having a nucleotide sequence at least 95% identical to a sequence selected 
from the group consisting of: 

(a) a nucleotide sequence encoding the (i-galactosidase II polypeptide 
having the complete amino acid sequence selected from the group consisting 
of SEQ ID NO:8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID 
NO:12, SEQK>NO:13, SEQIDNO:14, SEQ ID NO: 15 and SEQ ID NO: 16 
and designated TBG1, TBG2, TBG3, TBG4, TBG5, TBG6 and TBG7, 
respectively as shown in Figure 2 or as encoded by the cDNA clone selected 
from the group consisting of cDNA clones contained in Gen Bank Accession 
No. AF023847, AF154420, AF 154421, AF020390, AF154423, AF154424 and 
AF1 54422; 

(b) a nucleotide sequence encoding the mature (3-galactosidase II 
polypeptide having the amino acid sequence at about positions 24-724 selected 
from the group consisting of SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO: 15 and SEQ ID NO: 16 and designated TBG1, TBG2, TBG3, TBG4, 
TBG5, TBG6 and TBG7, respectively as shown in Figure 2 or as encoded by 
the cDNA clone selected from the group consisting of cDNA clones contained 
in Gen Bank Accession No. AF023847, AF154420, AF154421, AF020390, 
AF154423, AF154424 and AF154422; and 

(c) a nucleotide sequence complementary to any of the nucleotide 
sequences in (a) or (b), above. 

2. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, 
SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 as shown in Figure 2. 
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3. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the nucleotide sequence in Figure 2 (SEQ ID NO:4) 
encoding the (3-galactosidase II polypeptide having the amino acid sequence 
designated TBG4 in Figure 2. 

4. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the nucleotide sequence in Figure 2 (SEQ ID NO:4) 
encoding the mature polypeptide having the amino acid sequence from about 
24 to about 724 in the amino acid sequence designated TBG4 in Figure 2. 

5. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF023847. 

6. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF1 54420. 

7. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF154421. 

8. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF020390. 

9. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF 154423. 
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10. The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. AF1 54424. 

5 

1 1 . The nucleic acid molecule of claim 1 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in Gen Bank Accession No. API 54422. 

10 12. An isolated nucleic acid molecule comprising a polynucleotide 

which hybridizes under stringent hybridization conditions to a polynucleotide 
having a nucleotide sequence identical to a nucleotide sequence in (a), (b), or 
(c) of claim 1 wherein said polynucleotide which hybridizes does not hybridize 
under stringent hybridization conditions to a polynucleotide having a 

15 nucleotide sequence consisting of only A residues or of only T residues. 

13. An isolated nucleic acid molecule comprising a polynucleotide 
which encodes the amino acid sequence of an epi tope-bearing portion of a (3- 
galactosidase II polypeptide having an amino acid sequence in (a), (b), or (c) 

20 of claim 1. 

14. A method for making a recombinant vector comprising 
inserting an isolated nucleic acid molecule of claim 1 into a vector. 

25 15. A recombinant vector produced by the method of claim 14. 

16. A method of making a recombinant host cell comprising 
introducing the recombinant vector of claim 15 into a host cell. 



30 



17. 



A recombinant host cell produced by the method of claim 1 6. 
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18. A recombinant method for producing p-galactosidase II 
polypeptide, comprising culturing the recombinant host cell of claim 17 under 
conditions such that said polypeptide is expressed and recovering said 
polypeptide. 

19. An isolated p-galactosidase II polypeptide comprising an amino 
acid sequence at least 95% identical to a sequence selected from the group 
consisting of: 

a) amino acid sequence at about positions 24-724 selected from the 
group consisting of sequences SEQ ID NO: 8, SEQ ED NO: 9, SEQ ID NO: 
10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ 
ID NO: 15 and SEQ ID NO: 16 and designated TBG1, TBG2, TBG3, TBG4, 
TBG5, TBG6 and TBG7, respectively as shown in Figure 2; and 

b) amino acid sequence as encoded by the cDNA clone selected from 
the group consisting of cDNA clones contained in Gen Bank Accession No. 
AF023847, AF154420, AF154421, AF020390, AF154423, AF 154424 and 
AF1 54422. 

20. An isolated polypeptide comprising an epitope-bearing portion 
of the (i-galactosidase II protein. 

21 . An isolated antibody that binds specifically to a [3-galactosidase 
II polypeptide of claim 20. 
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22. An isolated nucleic acid molecule nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 95% 
identical to a sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding the (3-galactosidase EI polypeptide 
having the complete amino acid sequence selected from the group consisting 
of SEQ ID NO:8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15 and SEQ ID NO: 16 
and designated TBG1, TBG2, TBG3, TBG4, TBG5, TBG6 and TBG7, 
respectively as shown in Figure 3 or as encoded by the cDNA clone selected 
from the group consisting of cDNA clones contained in Gen Bank Accession 
No. AF023847, AF154420, AF1 54421, AF020390, AF154423, AF154424 and 
AF1 54422; 

(b) a nucleotide sequence encoding the mature p-galactosidase II 
polypeptide having the amino acid sequence at about positions 24-724 selected 
from the group consisting of sequences SEQ ID NO: 8, SEQ ID NO: 9, SEQ 
ED NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 
14, SEQ ID NO: 15 and SEQ ID NO: 16 and designated TBG1, TBG2, TBG3, 
TBG4, TBG5, TBG6 and TBG7, respectively as shown in Figure 3 or as 
encoded by the cDNA clone selected from the group consisting of cDNA 
clones contained in Gen Bank Accession No. AF023847, API 54420, 

AF 154421, AF020390, AF154423, AF154424 and AF154422; and 

(c) a nucleotide sequence complementary to any of the nucleotide 
sequences in (a) or (b), above. 

23. The nucleic acid molecule of claim 22 wherein said 
polynucleotide has a complete nucleotide sequence in Figure 2 selected from 
the group consisting of SEQ ID NOs: 1-3 and 5-7. 
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24. The nucleic acid molecule of claim 22 wherein said 
polynucleotide has a nucleotide sequence in Figure 2 selected from the group 
consisting of SEQ ID NOs: 1-3 and 5-7 encoding the P-galactosidase 
polypeptide having the complete amino acid sequence designated TBG1-3 and 
5-7, respectively. 

25. The nucleic acid molecule of claim 22 wherein said 
polynucleotide has the nucleotide sequence in Figure 2 selected from the 
group consisting of SEQ ID NOs: 1-3 and 5-7encoding the mature polypeptide 
having the amino acid sequence designated TBG1-3 and 5-7, respectively. 

26. The nucleic acid molecule of claim 22 wherein said 
polynucleotide has the complete nucleotide sequence of the cDNA clone 
contained in an Gen Bank Accession No. selected from the group consisting of 
ATCC Deposit No. selected from the group consisting of AF023847, 
AF154420, AF154421, AF020390, AF154423, AF154424 and AF154422. 

27. A method of modifying cell wall metabolism in a plant which 
comprises transforming said plant with a DNA construct adapted to modify the 
activity of a P-galactosidase, growing said plant or its descendent and selecting 
a plant having modified cell wall characteristics, said construct comprising a 
transcriptional initiation region operative in plants operably linked to a DNA 
sequence encoding at least one P-galactosidase. 

28. A method as claimed in claim 27, wherein said DNA sequence 
is selected from the group consisting of the sequences of nucleic acid 
molecules claimed in claim 1 or claim 22. 

29. A plant cell transformed with a nucleic acid molecule as 
claimed in claim 1 or claim 22. 



30. 



A plant derived from a plant cell as claimed in claim 29. 
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32. A method for modifying (i-galactosidase gene expression in a 
plant comprising transforming said plant with a nucleic acid molecule as 
5 claimed in claim 1 or claim 22, growing the transformed plant and selecting a 

plant having modified (3-galactosidase gene expression when compared with 
an untransformed plant. 
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A plant seed derived from a plant as claimed in claim 30. 
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Figure 1. p-Galactosidase phylogenetic tree based on shared amino acid 
sequence identity. A. Tomato p-galactosidase (TBG) cDNAs. B. Plant 
p-galactosidases. Higgins-Sharp algorithm (UPGMA method) 
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Figure 2 

Sh et1 of 12 w 

0. M /clone na«t TBGl/pZBOa-1-10, acce—ion nu»b« AF023847, Saguence XD nuaber 1 

i m . 1TO .V!W 30 

306 ATS GGT TTT TOG ATG GCA ATG TTG CTG ATG TTG TTA TTG TGT TTA TOG GTT TCT TGT GGA ATT OCT TOT 374 
1 Met Gly Phe Trp Met Ala Met Leu Leu Met Leu Leu Leu Cys Leu Trp Val Ser Cys Gly He Ala Ser 23 

375 GTT TCA TAT GAC CAT AAA GOT ATC ATT GTA AAT GGA CAA AGA AAA ATT CTC ATT TCT ^ CC A ^" r 

24 Val Ser Tyr Asp His Lys Ala He He Val Asn Gly Gin Arg Lye He Leu He Ser Gly Ser He His 

AGC ACC CCT GAG ATG TGG CCA GAT CTT ATT CAG AAG GCA AAA GAA GOG GGA GTT GAT GTT 512 



^47 Tyr Pro Arg Ser Thr So Glu Met Trp Pro Asp Leu He Gin Lys Ala Lys Glu Gly Gly Val Asp Val 

513 ATA CAG ACT TAT GTT TTC TGG AAT GGG CAT GAG CCT GAA GAA GOG AAA TAT TAT TTT GAA GAG AGG TAT 
70 He Gin Thr Tyr Val Phe Trp Asn Gly His Glu Pro Glu Glu Gly Lys Tyr Tyr Phe Glu Glu Arg Tyr 

582 GAT TTA GTG AAG TTC ATT AAA GTG GTG CAA GAA GGA GGA CTT TAT GTG CAT CTT AGG ATT GGA OCT ^^T 
93 Asp Leu Val Lys Phe He Lys Val Val Gin Glu Ala Gly Leu Tyr Val His Leu Arg He Gly Pro Tyr 

651 GCA TGT GCT GAA TGG AAT TTT GGG GGT TTT CCT GTT TGG CTG AAG TAT GTT CCA GGT ATT AGT TTC 
116 Ala Cys Ala Glu Trp Asn Phe Gly Gly Phe Pro Val Trp Leu Lys Tyr Val Pro Gly He Ser Phe Arg 

720 ACA AAC AAT GAG CCA TTC AAG GCT GCA ATG CAA AAG TTC ACT ACT AAG ATT GTT GAT ATG ATG AAA ^^A 
139 Thr Asn Asn Glu Pro Phe Lys Ala Ala Met Gin Lys Phe Tar Thr Lys He Val Asp Met Met Lys Ala 

789 GAA AAG CTC TAT GAA ACT CAG GGT GGT CCA ATT ATT CTA TCT CAG ATA GAA AAT GAA TAT GGA CCT ATG 
162 Glu Lys Leu Tyr Glu Thr Gin Gly Gly Pro He He Leu Ser Gin He Glu Asn Glu Tyr Gly Pro Met 

858 GAG TGG GAA CTA GGT GAA CCT GGT AAA GTT TAC TCA GAA TGG GCA GCC AAA ATG GCT GTG GAT CTT GGC ^ 
185 Glu Trp Glu Leu Gly Glu Pro Gly Lys Val Tyr Ser Glu Trp Ala Ala Lys Met Ala Val Asp Leu Gly 

927 ACT GGT GTC CCA TGG ATC ATG TGC AAG CAA GAT GAT GTC CCT GAT CCT ATT ATT AAT ACT TGC AAT GGT 
208 Thr Sy Val Pro Trp He Met Cys Lys Gin Asp Asp Val Pro Asp Pro He He Asn Thr Cys Asn Gly 

996 TTC TAC TGT GAC TAC TTC ACA CCA AAT AAG GCT AAT AAA CCC AAG ATG TGG ACT GAA GCC TGG 

231 Phe Tyr Cys Asp Tyr Phe Thr Pro Asn Lys Ala Asn Lys Pro Lys Met Trp Thr Glu Ala Trp Thr Ala 

1065 TGG TTT ACC GAA TTT GGA GGT CCA GTT CCT TAC QGT CCT GCA GAG GAT ATG ^ val Ala Ara 

254 Trp Phe Thr Glu Phe Gly Gly Pro Val Pro Tyr Arg Pro Ala Glu Asp Met Ala Phe Ala Val Ala Arg 

1134 TTT ATA CAA ACG GGA GGC TCC TTC ATC AAT TAC TAT ATG TAT CAT GGA GGA ACA AAC TTT GGA AGG ACT 
277 Phe He Gin Thr Gly Gly Ser Phe He Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly Arg Thr 

1203 TCT GGT GGC CCA TTT ATT GCT ACT AGT TAT GAT TAT GAT GCA CCC CTA GAT GAA TTT GGG TCA TTA CGG 
300 Ser Gly Gly Pro Phe lie Ala Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Phe Gly Ser Leu Arg 

1272 CAG CCT AAA TGG GGT CAT CTG AAA GAT CTA CAT AGA GCA ATA AAG CTC TGT GAG CCA GCT TTA GTA TCT 
323 Gin Pro Lys Trp Gly His Leu Lys Asp Leu His Arg Ala He Lys Leu Cys Glu Pro Ala Leu Val Ser 

1341 GTA GAT CCA ACT GTG ACA TCC TTA GGA AAC TAT CAA GAG CO CC7T CHT TTC AAG ^ ^ TCT ^ GCC 
346 Val Asp Pro Thr Val Thr Ser Leu Gly Asn Tyr Gin Glu Ala Arg Val Phe Lys Ser Glu Ser Gly Ala 

1410 TGC GCT GCC TTC CTA GCA AAT TAC AAC CAG CAC TCT TTT GCT AAA GTG GCA TTT GGG AAC ATG CAT 
369 Cys Ala Ala Phe Leu Ala Asn Tyr Asn Gin His Ser Phe Ala Lys Val Ala Phe Gly Asn Met His Tyr 

1479 AAC TTG CCA CCC TGG TCT ATC AGC ATT CTT CCC GAC TGC AAG AAC ACT GTC TAT AAT ACT GCA AGG GTT 
392 Asn Leu Pro Pro Trp Ser He Ser He Leu Pro Asp Cys Lys Asn Thr Val Tyr Asn Thr Ala Arg Val 

1548 GGT GCT CAA AGT GCT CAG ATG AAG ATG ACT CCA GTC AGT AGA GGA TTC TCA TGG GAG TCA TTC AAT 
415 Gly Ala Gin Ser Ala Gin Met Lys Met Thr Pro Val Ser Arg Gly Phe Ser Trp Glu Ser Phe Asn. Glu 
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Qan./clona »a».» TBGl/pZBG2^10, a^slon nu-^r AF023847, 8.^V XD nu»!>-r 1 cent. 
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161, GAC GCA GCA TOG CAT GAA GAC GAC ACT TTC ACA GTT GTT QGG TTA TTG GAG CAG ATT AAT ATC ACA AGA 
438 Asp Ala Ala Ser His Glu Asp Asp Thr Phe Thr Val Val Gly Leu Leu Glu Glr> lie Asn lie Tnr Arg 

1686 GAT GTA TCT GAT TAC TTG TGG TAT ATG ACT GAC ATT GAG ATT GAT CCA ACA GAA GGA TTT TTG 2iKT 
«! Asp vll leISp Tyr Leu Trp Tyr Met Thr Asp lie Glu He Asp Pro Thr Glu Gly Phe Leu Asn Ser 

1755 GGA AAT 1X3G CCT TGG CTT ACT GTC TTT TCT GCT GGC CAT GCA TTG CAT GTA TTC GTG AAT TTA 
«4 Gly Asn Trp Pro Trp L^u Thr Val Phe Ser Ala Gly His Ala Leu His Val Phe Val Asn Gly Gin Leu 

1824 QCA GGA ACT GTG TAC GGA AGT TTA GAA AAC CCA AAA CTA ACT TTC AGC AAC GGT ATA AAT CTG AGA 
111 Ala Gly Thr Val Tyr Gly Ser Ueu Glu Asn Pro Lys Leu Tnr Phe Ser Asn Gly He Asn Leu Arc Ala 

in „, nryr GTG AAC AAG ATT TCT CTG CTA AGC ATT GCT GTT GGT CTT CCG AAC GTT GGC CCT CAT TTT ^|^G 
"30 Gly Val Asn Lys lie Ser Leu Leu Ser He Ala Val Gly Leu Pro Asn Val Gly Pro His Phe Glu Thr 

1962 TGG AAT GCT GGT GTT CTT GGA CCA GTT TCA CTT AAT GGA CTT AAT GAA GGA ACA AGA GAT TTA 
III Trp A^n All Gly Val Leu Gly Pro Val Ser Leu Asn Gly Leu Asn Glu Gly Thr Arc Asp Leu Thr Trp 

2031 CAG AAA TGG TTC TAC AAG GTT GGT CTA AAA GGA GAA GCC CTG AGT CTT CAT TCA CTC AGT »^ ^QB 

576 Gin Lys Trp Phe Tyr Lys val Gly Leu Lys Gly Glu Ala Leu Ser Leu His Ser Leu Ser Gly Ser Pro 598 

2100 TCC GTG GAG TGG GTG GAA GGC TCT TTA GTG GCT CAG AAG CAG CCA CTC AGT TGG TAT AAG ^^CT phe 
599 Ser V^ Glu Trp Val Glu Gly Ser Leu Val Ala Gin Lys Gin Pro Leu Ser Trp Tyr Lys Thr Thr Phe 

2169 AAT GCT CCA GAT GGA AAT GAA CCT TTG GCT TTA GAT ATG AAT ACC ATG GGC AAA Tro lie 

62 l Asn Ala Pro Asp Gly Asn Glu Pro Leu Ala Leu Asp Met Asn Thr Met Gly Lys Gly Gin Val Trp He 

223B AAT GGT CAG AGC CTC GGA CGC CAC TGG CCT GCA TAT AAA TCA TCT GGA AGT TGT AGT cvs Asn Tvr 

III Asn Gly Gin Ser Leu Gly Arg His Trp Pro Ala Tyr Lys Ser Ser Gly Ser Cys Ser Val Cys Asn Tyr 

■S££S = = = £S = ='= = 5S = S = S: = 5S = = 

2445 GGA ATC ACT TTA GTC AAA AGA GAA ATA GGG AGT GTT TGT GCT GAT ATA TAT GAG TGG p^A rMn r^ii 

"4 Gly iTe Thr Leu Val Lys Arg Glu He Gly Ser Val Cys Ala Asp He Tyr Glu Trp Gin Pro Gin Leu 

2652 TTC CAG CAG GGA AGC TGC CAT GCT CCG CGC TCA TAT GAT GCT TTC AAA AAG AAT v^T Glv Lvs Glu 

"3 S £n Gin Gly Ser Cys His Ala Pro Arg Ser Tyr Asp Ala Phe Lys Lys Asn Cys Val Gly Lys Glu 

2721 TCT TGC TCA GTA CAG GTA ACA CCA GAG AAT TTT GGA GGT GAT CCA TGT CGA AAC GTT CTA AAG AAA CTC 
III Ser J£ ™ val Gin Val Thr Pro Glu Asn Phe Gly Gly Asp Pro Cys Arg Asn Val Leu Lys Lys Leu 

2790 TCA GTG GAA GCC ATT TGT AGT TGA TGATTCTGAGTATACAAGTGAAAAAATACTTGAACCACTCA^ 
829 Ser Val Glu Ala He Cys Ser *** 
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Gone/clone name i TBG2 /pZBG2-l- 12 , acceeeion number AFX54420, Sequence ID number 2 



3 AGC AGA AGA AAA ACA CTG AAT TTT CCG TTA ATA CTA ACG GTG TTA ACT ATC CAC TTT GTG ATC GTC GCC 71 

1 Ser Arg Arg Lys Thr Leu Asn Phe Pro Leu lie Leu Thr Val Leu Thr lie His Phe Val lie Val Ala 23 

TTC AAG CCG TTC AAT GTC ACC TAC GAT AAC CGA GCT CTC ATC ATC GGC GGT AAA CGC CGT 140 



72 GGC GAG TAT 

24 Gly Glu Tyr Phe Lys Pro Phe Asn Val Thr Tyr Asp Asn Axg Ala Leu He He Gly Gly Lys Arg Arg 

141 ATG CTT ATC TCC GCC GGA ATT CAC TAC (XT CGC GCC ACT CCT GAG ATG TGG CCC ACA TTG ATA GCT AGG 
47 Met Leu He Ser Ala Gly He His Tyr Pro Arg Ala Thr Pro Glu Met Trp Pro Thr Leu He Ala Arg 

210 AGC AAA GAA GGT GGT GCA GAT GTC ATC GAG ACT TAT ACA TTT TGG AAT GGT CAT GAG CCA ACC AGG GGA 
70 Ser Lys Glu Gly Gly Ala Asp Val He Glu Thr Tyr Thr Phe Trp Asn Gly His Glu Pro Thr Arg Gly 

279 CAG TAC AAT TTT GAA GGA AGA TAT GAT ATT GTC AAG TTC GCA AAG CTA GTC GGA TCT CAT GGA CTG TTC 
93 Gin Tyr Asn Phe Glu Gly Arg Tyr Asp He Val Lys Phe Ala Lys Leu Val Gly Ser His Gly Leu Phe 

348 CTC TTT ATT CGA ATA GGT CCT TAT GCC TGT GCA GAA TGG AAC TTC GGG GGA TTC CCC ATA TGG CTT CGT 
116 Leu Phe He Arg He Gly Pro Tyr Ala Cys Ala Glu Trp Asn Phe Gly Gly Phe Pro He Trp Leu Arg 

417 GAT ATA CCT GGA ATA GAA TTT CGA ACA GAT AAT GCA CCA TTC AAG GAG GAG ATG GAG CGC TAT GTT AAA 
139 Asp He Pro Gly He Glu Phe Arg Thr Asp Asn Ala Pro Phe Lys Glu Glu Met Glu Arg Tyr Val Lys 

486 AAG ATA GTT GAT CTT ATG ATA TCT GAG TCG CTC TTT TOG TGG CAA GGT GGT CCT ATC ATT TTG CTG CAG 
162 Lys He Val Asp Leu Met He Ser Glu Ser Leu Phe Ser Trp Gin Gly Gly Pro He He Leu Leu Gin 

555 ATT GAA AAT GAA TAT GGA AAT GTT GAA AGC TCA TTC GGT CCC AAG GGG AAG TTA TAT ATG AAA TGG GCT 
185 He Glu Asn Glu Tyr Gly Asn Val Glu Ser Ser Phe Gly Pro Lys Gly Lys Leu Tyr Met Lys Trp Ala 

624 GCT GAA ATG GCT GTT GGT CTT GGT GCT GGT GTT CCA TGG GTC ATG TGC AGG CAA ACT GAT GCT CCA G j AA 
208 Ala Glu Met Ala Val Gly Leu Gly Ala Gly Val Pro Trp Val Met Cys Arg Gin Thr Asp Ala Pro Glu 

693 TAC ATC ATA GAT ACT TGT AAT GCA TAC TAT TGT GAT GGG TTC ACG CCG AAT TCC GAG AAG AAA 

231 Tyr He He Asp Thr Cys Asn Ala Tyr Tyr Cys Asp Gly Phe Thr Pro Asn Ser Glu Lys Lys Pro Lys 

762 ATT TGG ACT GAG AAT TGG AAT GGA TGG TTT GCA GAT TGG GGT GAA AGA CTT CCA TAT AGA CCT ^CC 
254 He Trp Thr Glu Asn Trp Asn Gly Trp Phe Ala Asp Trp Gly Glu Arg Leu Pro Tyr Arg Pro Ser Glu 

831 GAT ATT GCA TTT GCA ATT GCT CGT TTC TTT CAA CGT GGG GGC AGC TTA CAG AAC TAT TAT ATG TAT TTT 
277 Asp He Ala Phe Ala He Ala Arg Phe Phe Gin Arg Gly Gly Ser Leu Gin Asn Tyr Tyr Met Tyr Phe 

900 GGT GGG ACA AAT TTT GGC CGG ACT GCT GGT GGC CCA ACT CAA ATC ACT AGC TAT GAT TAT GAT GCT CCA 
300 Gly Gly Thr Asn Phe Gly Arg Thr Ala Gly Gly Pro Thr Gin lie Thr Ser Tyr Asp Tyr Asp Ala Pro 

969 CTG GAT GAA TAT GGA CTA CTA CGT CAA CCT AAA TGG GGC CAT TTG AAG GAT CTG CAT GCT GCT ATA AAG 
323 Leu Asp Glu Tyr Gly Leu Leu Arg Gin Pro Lys Trp Gly His Leu Lys Asp Leu His Ala Ala He Lys 

1038 CTT TGT GAA CCA GCT CTT GTT GCT GCT GAT TCA CCT CAG TAT ATT AAA CTG GGA CCA AAA CAG GAG 
346 Leu Cys Glu Pro Ala Leu Val Ala Ala Asp Ser Pro Gin Tyr He Lys Leu Gly Pro Lys Gin Glu Ala 

1107 CAT GTC TAT CGT GGA ACA TCC AAC AAC ATT GGC CAA TAT ATG TCC TTA AAT GAA GGC ATA TGC GCA GCA 
369 His Val Tyr Arg Gly Thr Ser Asn Asn He Gly Gin Tyr Met Ser Leu Asn Glu Gly He Cys Ala Ala 

1176 TTT ATT GCA AAT ATT GAT GAA CAT GAA TCA GCA ACA GTG AAA TTT TAC GGT CAA GAG TTC ACT TTA CCT 
392 Phe He Ala Asn He Asp Glu His Glu Ser Ala Thr Val Lys Phe Tyr Gly Gin Glu Phe Thr Leu Pro 

1245 CCA TGG TCA GTG GTA TTC TGC CAG ATT GCA GAA ATA CAG CTT TCA ACA CAG CTA AGG TGG GGG CAC AAA 
415 Pro Trp Ser Val Val Phe Cys Gin He Ala Glu He Gin Leu Ser Thr Gin Leu Arg Trp Gly His Lys 

1314 CTT CAA TCA AAA CAG TGG GCT CAG ATT CTG TTT CAG TTG GGA ATA ATT CTT TGT TTC TAC AAG TTA TCA 
438 Leu Gin Ser Lys Gin Trp Ala Gin He Leu Phe Gin Leu Gly lie He Leu Cys Phe Tyr Lys Leu Ser 



46 



209 
69 



278 
92 



347 

115 



416 

138 



485 

161 



554 
184 



623 
207 



692 
230 



761 
253 



830 
276 



899 
299 



968 
322 



1037 
345 



1106 
368 



1175 
391 



1244 
414 



1313 
437 



1382 
460 
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1520 
506 

1589 
529 

1658 
552 

1727 
575 

1796 
598 

1865 
621 

1934 



2003 
667 

2072 



Figure 2 
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852 Gly He Ser Asn Gly Val Phe Gly Asp Pro Cys Arg His Val Val Lys Ser Leu Aia 
875 Cys Ser Pro Pro Pro Asp Leu Ser Thr Ser Ala Ser Ser 

2979 AAAAAA 
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2486 
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G eae/clone name: TBQ3/p2- Oc/bl; accession number AF154421, ^pence ID number 3 



AGAGTTCATTATTTTTTTTGCATTTTGAAA 30 
31 AAGAGGAAAAAAATAAAGTTAAJ^GGGGGGGGAAAAAGT^ 121 



122 ATG GGT TGT 
1 Met Gly Cys 


ACG 
Thr 


CTT 
Leu 


ATA 
He 


CTA 
Leu 


ATG 
Met 


TTG 
Leu 


AAT 
Asn 


GTG 
Val 


TTG 
Leu 


TTG 
Leu 


GTG 
Val 


TTG 
Leu 


TTG 

Leu 


GGT 
Gly 


TCA 

Ser 


TGG GTT 
Trp Val 


TTT 
Phe 


TCT 
Ser 


GGA 
Gly 


190 

23 


191 ACA GCT TCT 
24 Thr Ala Ser 


GTT 
Val 


TCA 
Ser 


TAT 
Tyr 


GAC 
Asp 


CAT 
His 


AGG 
Arg 


GCT 
Ala 


ATT 
He 


ATT 

He 


GTA 

Val 


AAT GGA 
Asn Gly 


CAA 
Gin 


AGA 
Arg 


AGA ATA CTT ATT 
Arg He Leu He 


TCT GGT 
Ser Gly 


259 
46 


260 TCT GTT CAT 
47 Ser Val His 


TAT 
Tyr 


CCA 
Pro 


AGA 
Arg 


AGC 
Ser 


ACT 
Thr 


CCT 
Pro 


GAG 
Glu 


ATG 
Met 


TGG 
Trp 


CCA 
Pro 


GGT 
Gly 


ATT 
He 


ATT 
He 


CAA AAG GCT AAA GAA GGA GGT 
Gin Lys Ala Lys Glu Gly Gly 


328 
69 


329 GTG GAT GTG 
70 Val Asp Val 


ATT 
He 


CAG 
Gin 


ACT 
Thr 


TAT 
Tyr 


GTT 
Val 


TTC 
Phe 


TGG 
Trp 


AAT 
Asn 


GGA 
Gly 


CAT 
His 


GAG 
Glu 


CCT 
Pro 


CAA 
Gin 


CAA GGG AAA TAT TAT 
Gin Gly Lys Tyr Tyr 


TTT 
Phe 


GAA 
Glu 


397 
92 


398 GGG AGA TAT 
93 Gly Arg Tyr 


GAT 
Asp 


TTA 
Leu 


GTG 
Val 


AAG 
Lys 


TTT 
Phe 


ATT AAG 
He Lys 


CTG 
Leu 


GTG 
Val 


CAC 
His 


CAA 
Gin 


GCA 
Ala 


GGA 
Gly 


CTT TAT GTC CAT 
Leu Tyr Val His 


CTT 
Leu 


AGA 
Arg 


GTT 
Val 


466 
115 


467 GGA CCT TAT 
116 Gly Pro Tyr 


GCT 
Ala 


TGT 
Cys 


GCT GAA TGG AAT TTT GGG 
Ala Glu Trp Asn Phe Gly 


GGC 
Gly 


TTT 
Phe 


CCT GTT 

Pro Val 


TGG 
Trp 


CTG 
Leu 


AAA TAT GTT 
Lys Tyr Val 


CCA GGT 
Pro Gly 


ATC 
He 


535 
138 


536 ACT TTC AGA 
139 Ser Phe Arg 


ACA 
Thr 


GAT 
Asp 


AAT GGA 
Asn Gly 


CCT 
Pro 


TTC 
Phe 


AAG 
Lys 


GCT 
Ala 


GCA 
Ala 


ATG 
Met 


CAA 
Gin 


AAA 
Lys 


TTT 
Phe 


ACT 
Thr 


GCC 
Ala 


AAG ATT 
Lys He 


GTC 
val 


AAT . ATG 
Asn Met 


604 
161 


605 ATG AAA GCG 
162 Met Lys Ala 


GAA 
Glu 


CGT 
Arg 


TTG 
Leu 


TAT 
Tyr 


GAA ACT CAA GGG 
Glu Thr Gin Gly 


GGG 
Gly 


CCA 
Pro 


ATA ATT 
He He 


TTA 
Leu 


TCT 
Ser 


CAG 
Gin 


ATT GAG 
He Glu 


AAT 
Asn 


GAA 
Glu 


TAT 
Tyr 


673 
184 


674 GGA CCC ATG 
185 Gly Pro Met 


GAA 
Glu 


TGG 
Trp 


GAA 
Glu 


CTG 
Leu 


GGA GCA CCA GGT 
Gly Ala Pro Gly 


AAA 

Lys 


TCT 
Ser 


TAC GCA 
Tyr Ala 


CAG 
Gin 


TGG 
Trp 


GCC 
Ala 


GCC AAA 
Ala Lys 


ATG 

Met 


GCT 
Ala 


GTG 
Val 


742 

207 


743 GGT CTT GAC ACT GGT GTC 
208 Gly Leu Asp Thr Gly Val 


CCA TGG GTT 
Pro Trp Val 


ATG 
Met 


TGC 
Cys 


AAG 
Lys 


CAA 
Gin 


GAC GAT 
Asp Asp 


GCC 
Ala 


CCT 
Pro 


GAT 
Asp 


CCT ATT 
Pro He 


ATA AAT 
He Asn 


GCT 
Ala 


811 
230 


812 TGC AAT GGC 
231 Cys Asn Gly 


TTC 
Phe 


TAC 
Tyr 


TGT 
Cys 


GAC 
Asp 


TAC 
Tyr 


TTT 
Phe 


TCT 
Ser 


CCA 
Pro 


AAC 
Asn 


AAG 
Lys 


GCT TAT 
Ala Tyr 


AAA 
Lys 


CCA 
Pro 


AAG 
Lys 


ATA TGG ACT 
He Trp Thr 


GAA. 
Glu 


GCC 
Ala 


880 
253 


881 TOG ACT GCA 
254 Trp Thr Ala 


TGG 
Trp 


TTT ACT GGT 
Phe Thr Gly 


TTT GGA AAT 
Phe Gly Asn 


CCA 
Pro 


GTT 
Val 


CCT 
Pro 


TAC CGT 
Tyr Arg 


CCT 
Pro 


GCT 
Ala 


GAG 
Glu 


GAC TTG 
Asp Leu 


GCA 
Ala 


TTT 
Phe 


TCT 
Ser 


949 
276 


950 GTT GCA AAA 
277 Val Ala Lys 


TTT 
Phe 


ATA 

He 


CAG 
Gin 


AAG 
Lys 


GGA GGT TCC 
Gly Gly Ser 


TTC 
Phe 


ATC 
He 


AAT 
Asn 


TAT TAC 
Tyr Tyr 


ATG 
Met 


TAT 
Tyr 


CAT GGA GGA ACA 
His Gly Gly Thr 


AAC 
Asn 


TTT 
Phe 


1018 
299 


1019 GGA CGG ACT 
300 Gly Arg Thr 


GCT GGT GGT 
Ala Gly Gly 


CCA 
Pro 


TTT 

Phe 


ATT 
He 


GCT 
Ala 


ACT 
Thr 


AGT 
Ser 


TAT 
Tyr 


GAC 
Asp 


TAT 

Tyr 


GAT 
Asp 


GCA 
Ala 


CCA 
Pro 


CTT GAT 
Leu Asp 


GAA TAT GGA 
Glu Tyr Gly 


1087 
322 


1088 TTA TTG CGA 
323 Leu Leu Arg 


CAA 

Gin 


CCA 
Pro 


AAA 

Lys 


TGG GGT CAC 
Trp Gly His 


CTG 
Leu 


AAA 
Lys 


GAT 
Asp 


CTG 
Leu 


CAT 
His 


AGA 
Arg 


GCA 
Ala 


ATA 
He 


AAG 
Lys 


CTT TGT 
Leu Cys 


GAA 
Glu 


CCA 
Pro 


GCT 
Ala 


1156 
345 


1157 TTA GTC TCT 
346 Leu Val Ser 


GGA GAT 
Gly Asp 


CCA 
Pro 


GCT 
Ala 


GTG 
Val 


ACA 
Thr 


GCA 
Ala 


CTT 
Leu 


GGA 
Gly 


CAC 
His 


CAG 
Gin 


CAG 
Gin 


GAG 
Glu 


GCC 
Ala 


CAT 
His 


GTT TTT 
Val Phe 


AGG 
Arg 


TCC 
Ser 


AAG 
Lys 


1225 
368 


1226 GOT GGC TCT 
369 Ala Gly Ser 


TGT 
Cys 


GCT 
Ala 


GCA 
Ala 


TTC 
Phe 


CTT 
Leu 


GCT AAC 
Ala Asn 


TAC 
Tyr 


GAC 
Asp 


CAA 
Gin 


CAC 
His 


TCT 
Ser 


TTT 
Phe 


GCT 
Ala 


ACT 
Thr 


GTG TCA 
Val Ser 


TTT 

Phe 


GCA 
Ala 


AAC 
Asn 


1294 
391 


1295 AGG CAT TAC 
392 Arg His Tyr 


AAC 
Asn 


TTG 
Leu 


CCA 
Pro 


CCA 
Pro 


TGG 
Trp 


TCA 
Ser 


ATC 
He 


AGC 
Ser 


ATT 
He 


CTT 
Leu 


CCC 
Pro 


GAC 
Asp 


TGC 
Cys 


AAG 
Lys 


AAC 
Asn 


ACT GTA 
Thr Val 


TTT 
Phe 


AAT 
Asn 


ACA 
Thr 


1363 
414 


1364 GCA CGG ATC 
415 Ala Arg He 


GGT GCT 
Gly Ala 


CAA 
Gin 


AGT 
Ser 


GCT 
Ala 


CAG 
Gin 


ATG 
Met 


AAG 
Lys 


ATG 
Met 


ACT 
Thr 


CCA 
Pro 


GTC 
Val 


AGC 
Ser 


AGA 
Arg 


GGA 
Gly 


TTG CCC 
Leu Pro 


TGG 
Trp 


CAG 
Gin 


TCA 
Ser 


1432 
437 


1433 TTC AAT GAA 
438 Phe Asn Glu 


GAG 
Glu 


ACA 
Thr 


TCA 
Ser 


TCT 
ser 


TAT 
Tyr 


GAA 
Glu 


GAC 
Asp 


AGT 
Ser 


AGT 
Ser 


TTT 
Phe 


ACA 
Thr 


GTT 
Val 


GTT 
Val 


GGG 
Gly 


CTA 
Leu 


TTG GAA 
Leu Glu 


CAG 
Gin 


ATA 
He 


AAT 
Asn 


1501 
460 
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Oone/clooe name: TBa3/p2-l-:^l>l ; acceeeion number XT1S4421, Sa^ce ID numW "3 coat. 

1502 ACA ACA AGA GAC GTG TCT GAT TAT TTG TGG TAT TCA ACA GAT GTC AAG ATT GAT TCA AGA GAA AAG 1570 

461 Thr Thr Arg Asp Val Ser Asp Tyr Leu Trp Tyr Ser Thr Asp Val Lys He Asp Ser Arg Glu Lys Phe 483 

1571 TTG AGA GGC GGA AAA TGG CCT TGG CTT ACG ATC ATG TCA GCT GGG CAT GCA TTG CAT GTT TTT GTG AAT 1639 

484 Leu Arg Gly Gly Lys Trp Pro Trp Leu Thr He Met Ser Ala Gly His Ala Leu His Val Phe Val Asn 506 

1640 GGT CAA TTA GCA GGA ACT GCA TAT GGA ACT TTA GAA AAA COG AAA CTA ACT TTC AGT AAA GCC GTA AAT 
507 Gly Gin Leu Ala Gly Thr Ala Tyr Gly Ser Leu Glu Lys Pro Lys Leu Thr Phe Ser Lys Ala Val Asn 

1709 CTG AGA GCA GGT GTT AAC AAG ATT TCT CTA CTG AGC ATT GCT GTT GGC CTT CCG AAT ATC GGC CCA CAT 1777 

530 Leu Arg Ala Gly Val Asn Lys He Ser Leu Leu Ser He Ala Val Gly Leu Pro Asn He Gly Pro His 552 

1778 TTT GAG ACA TGG AAT GCT GGT GTT CTT GGG CCA GTC TCA CTA ACT GGT CTT GAC GAG GGG AAA AGA GAT 1846 

553 Phe Glu Thr Trp Asn Ala Gly Val Leu Gly Pro Val Ser Leu Thr Gly Leu Asp Glu Gly Lys Arg Asp 575 

1847 TTA ACA TGG GAG AAA TGG TCT TAC AAG GTT GGT CTA AAA GGA GAA GCC TTG AGC CTC CAT TCA CTC AGT 1915 



Trp ser Tyr Lys Val Gly Leu Lys Gly Glu Ala Leu Ser Leu His Ser Leu Ser 



1708 
529 



576 Leu Thr Trp Gin Lys 

1916 GGT AGC TCG TCA GTT GAG TGG GTC GAG GGT TCT TTA GTG GCT CAG AGA CAG CCA CTC ACA TGG TAC AAG 1984 
599 Gly Ser Ser Ser Val Glu Trp Val Glu Gly Ser Leu Val Ala Gin Arg Gin Pro Leu Thr Trp Tyr Lys 621 

1985 AGC ACT TTT AAT GCT CCA GCT GGA AAT GAT CCT TTG GCT TTA GAC TTG AAT ACC ATG GGC AAA 2 °" 
622 Ser Thr Phe Asn Ala Pro Ala Gly Asn Asp Pro Leu Ala Leu Asp Leu Asn Thr Met Gly Lys Gly Gin 644 

2054 GTG TGG ATA AAT GGT CAA AGC CTC GGA CGC TAT TGG CCT GGA TAT AAA GCA TCT GGT AAC TGC 
645 Val Trp He Asn Gly Gin Ser Leu Gly Arg Tyr Trp Pro Gly Tyr Lys Ala Ser Gly Asn Cys Gly Ala 

2123 TGT AAC TAT GCA GGC TGG TTT AAT GAG AAA AAA TGC CTA AGT AAC TGT GGA GAG GCT TCA CAA CGA TGG 
668 Cys Asn Tyr Ala Gly Trp Phe Asn Glu Lys Lys Cys Leu Ser Asn Cys Gly Glu Ala Ser Gin Arg Trp 

2192 TAT CAT GTT CCC CGT TCT TGG CTG TAT CCT ACT GGA AAT TTG TTAGTTCTATTTGAGGAATGGGCAGGA 
691 Tyr His Val Pro Arg Ser Trp Leu Tyr Pro Thr Gly Asn Leu Leu Val Leu Phe Glu Glu Trp Gly Gly 

2261 GAG CCT CAT GGA ATC TCT TTG GTA AAA AGA GAA GTT GCA AGT GTT TGT GCA GAT ATA AAC GAA TGG CAA 
714 Glu Pro His Gly He Ser Leu Val Lys Arg Glu Val Ala Ser Val Cys Ala Asp He Asn Glu Trp Gin 

2330 CCA CAG TTG GTG AAT TGG CAA ATG CAA GCA TCT GGT AAA GTT GAC AAA CCA CTG AGA AAA 
737 Pro Gin Leu Val Asn Trp Gin Met Gin Ala Ser Gly Lys Val Asp Lys Pro Leu Arg Pro Lys Ala Has 

2399 CTC TCG TGT GCT TCT GGT CAG AAG ATT ACT TCA ATC AAA TTT GCA AGC TTT GGA ^pA r^v Val ^lH 

760 Leu Ser Cys Ala Ser Gly Gin Lys He Thr Ser He Lys Phe Ala Ser Phe Gly Thr Pro Gin Gly Val 782 

2468 TGC GGA AGC TTC CGT GAA GGA AGC TGC CAC GCC TTC CAC TCA TAT GAT GCT TTT GAA AGG TAT 
783 Cys Gly Ser Phe Arg Glu Gly Ser Cys His Ala Phe His Ser Tyr Asp Ala Phe Glu Arg Tyr Cys lie 

2537 GGG CAA AAC TCG TGC TCA GTA CCT GTA ACA CCA GAG ATC TTT GGA GGT GAT CCA TGT v/^T M«=«r 

806 Gly Gin Asn Ser Cys Ser Val Pro Val Thr Pro Glu He Phe Gly Gly Asp Pro Cys Pro Has Val Met 

2606 AAG AAA CTC TCA GTT GAG GTT ATT TGC AGT TGA TGACACTGAGGAGAAACAAATAAAAGTGGTT^ 2686 
829 Lys Lys Leu Ser Val Glu Val He Cys Ser *-* 

2687 CATATCAAAAAGTTGGCTTTGATGGAG Vll 

2779 GATTCTGTACATATATGTTGGTTAC ^ 

2871 ATTGTGCTAGTGGGAGGTAGTAGGTACCGATCGATCT V>™ 

2963 ACAATGAGACTGATTCTTGATTAGTCCATGTCT \f s \ 
3055 AAAAAAAAAAAAAAA 



2122 
667. 



2191 
690 



2260 
713 



2329 
736 



2398 
759 



2536 
805 



2605 
828 



WO 99/64564 



PCT7US99/12697 



8 / 31 



Figure 2 

Sheet 7 of 12 

eM /cIon. name: TBG4 /pZBG2 - ^ pTomps,«14, accession number APO2 0 3^T Stance ID number 4 



64 



AAAAAAAGTTTCAA rri^Vri 'rcTAAAATAA 63 

ATG CTA AGG ACT AAT GTG TTG TTG TTA TTA GTT ATT TGT TTA TTG GAT TTT TTT TCT TCA GTG AAA GCT 132 

23 



1 Met Leu Arci Thr Asn Val Leu Leu Leu Leu Val lie Cys Leu Leu Asp Phe Phe Ser Ser Val Lys Ala 

133 AGT GTT TCT TAT GAT GAC AGA GCT ATA ATC ATA AAT GGG AAA AGA AAA ATT CTT ATT TCT GGT TCA ATT 
24 Ser Val Ser Tyr Asp Asp Arg Ala He He He Asn Gly Lys Arg Lys He Leu He Ser Gly Ser He 



Tyr Asp Leu Val Arg Phe He Lys Met Val Gin Arg Ala Gly Leu Tyr Val Asn Leu Arg He Gly Pro 115 



202 - - ~ 

47 Hie Tyr Pro Arg Ser Thr Pro Gin Met Trp Pro Asp Leu He Gin Lys Ala Lys Asp Gly Gly Leu Asp 

271 GTT ATT GAA ACT TAT GTT TTC TGG AAT GGA CAT GAG CCT TCT CCT GGA AAA TAT AAT TTT GAA GGA AGA 
70 Val He Glu Thr Tyr Val Phe Trp Asn Gly His Glu Pro Ser Pro Gly Lys Tyr Asn Phe Glu Gly Arg 

340 TAT GAT CTT GTT AGA TTC ATC AAA ATG GTA CAA AGA GCA GGA CTT TAT GTC AAT TTA CGT ATT GGC : CCT 
93 

409 TAC GTC TGT GCT GAA TGG AAC TTT GGG GGA TTC CCT GTT TGG CTA AAA TAT GTG CCT GGT ATG GAA TTT 
116 Tyr Val Cys Ala Glu Trp Asn Phe Gly Gly Phe Pro Val Trp Leu Lys Tyr Val Pro Gly Met Glu Phe 

478 AGA ACA AAC AAT CAG CCT TTT AAG GTG GCT ATG CAA GGA TTT GTT CAG AAA ATA GTC AAC ATG ATG AAG 
139 Arg Thr Asn Asn Gin Pro Phe Lys Val Ala Met Gin Gly Phe Val Gin Lys He Val Asn Met Met Lys 

547 TCA GAA AAT TTG TTT GAA TCT CAA GGA GGA CCA ATA ATT ATG GCC CAG ATA GAA AAT GAG TAT GGA CCA 
162 Ser Glu Asn Leu Phe Glu Ser. Gin Gly Gly Pro He He Met Ala Gin He Glu Asn Glu Tyr Gly Pro 

616 GTA GAA TGG GAA ATT GGT GCT CCT GGT AAA GCT TAT ACA AAA TGG GCA GCT CAA ATG GCT GTA GGT TTG 
185 val Glu Trp Glu He Gly Ala Pro Gly Lys Ala Tyr Thr Lys Trp Ala Ala Gin Met Ala Val Gly Leu 



gQtj 

208 Lys Thr Gly Val Pro Trp lie Met Cys Lys Gin Glu Asp Ala Pro Asp Pro Val He Asp Thr Cys Asn 

754 GGC TTC TAC TGC GAA GGG TTC CGT CCT AAT AAG CCT TAC AAA CCT AAA ATG TGG ACA GAA GTA TGG ACT 
231 Gly Phe Tyr Cys Glu Gly Phe Arg Pro Asn Lys Pro Tyr Lys Pro Lys Met Trp Thr Glu Val Trp Thr 

823 GGC TGG TAT ACG AAA TTC GGT GGT CCA ATT CCT CAA AGA CCA GCC GAA GAC ATT GCA TTT TCA GTT GCC 
254 Gly Trp Tyr Thr Lys Phe Gly Gly Pro He Pro Gin Arg Pro Ala Glu Asp He Ala Phe Ser Val Ala 

892 AGG TTT GTT CAG AAC AAT GGT TCA TTC TTC AAT TAC TAC ATG TAT CAT GGA GGA ACA AAT TTT GGC CGG 
277 Arg Phe Val. Gin Asn Asn Gly Ser Phe Phe Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly Arg 

961 ACA TCA TCA GGG CTT TTC ATT GCA ACT AGC TAC GAT TAT GAT GCT CCT CTC GAT GAA TAT GGG TTG CTG 
300 Thr Ser Ser Gly Leu Phe He Ala Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Tyr Gly Leu Leu 

1030 AAT GAA CCA AAG TAT GGG CAC TTG AGA GAC TTA CAT AAA GCT ATC AAG CTA TCT GAA CCG GCT TTA GTT 
323 Asn Glu Pro Lys Tyr Gly His Leu Arg Asp Leu His Lys Ala He Lys Leu Ser Glu Pro Ala Leu Val 

1099 TCA TCA TAT GCT GCG GTG ACT AGT CTT GGA AGT AAT CAA GAG GCT CAT GTT TAT AGA TCA AAA TCT GGA 
346 Ser Ser Tyr Ala Ala Val Thr Ser Leu Gly Ser Asn Gin Glu Ala His Val Tyr Arg Ser Lys Ser Gly 

1168 GCT TGT GCT GCT TTT TTA TCC AAC TAT GAC TCT AGA TAT TCA GTA AAA GTC ACC TTT CAG AAT AGG CCA 
369 Ala Cys Ala Ala Phe Leu Ser Asn Tyr Asp Ser Arg Tyr Ser Val Lys Val Thr Phe Gin Asn Arg Pro 

1237 TAC AAT CTG CCT CCA TGG TCC ATC AGC ATT CTT CCC GAC TGC AAA ACT GCC GTT TAC AAC ACT GCA CAG 
392 Tyr Asn Leu Pro Pro Trp Ser He Ser He Leu Pro Asp Cys Lys Thr Ala Val Tyr Asn Thr Ala Gin 



1306 



415 Val Asn Ser Gin Ser Ser Ser He Lys Met Thr Pro Ala Gly Gly Gly Leu Ser Trp Gin Ser Tyr Asn 

1375 GAA GAA ACG CCT ACT GCT GAT GAC AGC GAT ACA CTT ACA GCT AAC GGA CTA TGG GAA CAG AAA AAC GTC 
438 Glu Glu Thr Pro Thr Ala Asp Asp Ser Asp Thr Leu Thr Ala Asn Gly Leu Trp Glu Gin Lys Asn Val 



201 
46 



CAT TAT CCA AGA AGC ACT CCA CAG ATG TGG CCT GAT CTT ATA CAA AAG GCT AAA GAT GGA GGC TTA GAT 270 



69 



339 
92 



408 



477 
138 



546 

161 



615 
184 



684 

207 



AAA ACT GGT GTC CCA TGG ATC ATG TGT AAG CAA GAG GAT GCT CCT GAT CCT GTG ATT GAT ACT TGT AAT 753 



230 



822 
253 



891 
276 



960 
299 



1029 
322 



1098 
345 



1167 
368 



1236 
391 



1305 
414 



GTT AAC TCT CAA AGC TCG AGC ATA AAG ATG ACG CCT GCA GGT GGT GGA TTG TCT TGG CAG TCA TAC AAT 1374 



437 



1443 
460 
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461 



Thr Arg Asp Ser Sex Asp lyr Leu Trp Tyr Met Thr Asn Val Asn He Ala Ser Asn Glu Gly Phe Leu 



53 



0 Arg Ala Gly He Asn Lys He Ser Leu Leu Ser Val Ser Val Gly Leu Pro Asn Val Gly Val His Tyr 



553 Asp Thr Trp Asn 



cone. 



1444 ACA AGA GAT TCA TCA GAC TAT CTG TGG TAG ATG ACA AAT GTA AAT ATA GCA TCT AAT GAA GGA TTT CTA 1512 



483 



1513 AAG AAC GGA AAG GAT CCT TAT CTC ACT GTT ATG TCC GCT GGT CAT GTC TTG CAT GTT TTC GTC AAT GGA 1581 



506 



484 Lys Asn Gly Lys Asp Pro Tyr Leu Thr Val Met Ser Ala Gly His Val Leu His Val Phe Val Asn Gly 

1582 AAA CTA TCA GGA ACT GTT TAT GGT ACA TTG GAT AAT CCA AAA CTT ACA TAC AGT GGC AAC -GTG AAG TTA 
507 Lys Leu Ser Gly Thr Val Tyr Gly Thr Leu Asp Asn Pro Lys Leu Thr Tyr Ser Gly Asn Val Lys Leu 

1651 AGA GCT GGT ATT AAC AAG ATT TCT CTG CTC AGT GTT TCC GTT GGT CTC COG AAC GTT GGC GTG CAT TAT 1719 



1650 
529 



552 



1720 GAT ACA TGG AAT GCA GGA GTT CTA GGT CCA GTC ACG TTG AGC GGT CTC AAT GAA GGG TCA AGA AAC TTG 1788 



575 



Ala Gly Val Leu Gly Pro Val Thr Leu Ser Gly Leu Asn Glu Gly Ser Arg Asn Leu 

17B9 GCG AAA GAG AAA TGG TCT TAC AAG GTT GGT CTG AAA GGC GAA TOG TTA AGT CTT CAC TCC TTA AGT GGG 
576 Ala Lys Gin Lys Trp Ser Tyr Lys Val Gly Leu Lys Gly Glu Ser Leu Ser Leu His Ser Leu Ser Gly 

1858 AGT TCT TCT GTT GAA TGG GTT CGA GGT TCA CTA ATG GCT CAA AAG CAG CCC CTG ACT TGG TAC AAG GCT 
599 Ser Ser Ser Val Glu Trp Val Arg Gly Ser Leu Met Ala Gin Lys Gin Pro Leu Thr Trp Tyr Lys Ala 

1927 ACA TTT AAC GCG CCT GGA GGA AAT GAT CCA CTA GCT TTA GAC ATG GCA AGT ATG GGA AAA GGT CAG ATA 1995 
622 Thr Phe Asn Ala Pro Gly Gly Asn Asp Pro Leu Ala Leu Asp Met Ala Ser Met Gly Lys Gly Gin He 644 

1996 TGG ATA AAT GGT GAA GGC GTA GGT CGC CAT TGG CCT GGA TAC ATA GCA CAA GGC GAC TGC AGC AAA TGC 
645 Trp He Asn Gly Glu Gly Val Gly Arg His Trp Pro Gly Tyr lie Ala Gin Gly Asp Cys Ser Lys Cys 

2065 AGT TAT GCT GGA ACG TTC AAC GAG AAG AAG TGC CAG ACT AAC TGC GGA CAA CCT TCT CAG AGA TGG TAC 2133 

_____ _ — - - *- 690 



1857 
598 



1926 
621 



2064 
667 



2202 
713 



668 Ser Tyr Ala Gly Thr Phe Asn Glu Lys Lys Cys Gin Thr Asn Cys Gly Gin Pro Ser Gin Arg Trp Tyr 

2134 CAT GTT CCA CGA TOG TGG CTG AAA CCA AGT GGA AAC TTG TTA GTA . GTA TTC GAA GAA TGG GGA GGT AAT 
691 His Val Pro Arg Ser Trp Leu Lys Pro Ser Gly Asn Leu Leu Val Val Phe Glu Glu Trp Gly Gly Asn 

2203 CCA ACA GGA ATT TCT CTA GTC AGG AGA TCA AGA TAA AGAACTCGAAAAGTAAAACTTGTTCAGTAACTATG 2282 
714 Pro Thr Gly He Ser Leu Val Arg Arg Ser Arg *** 725 

2283 TTCGCGCCGAAAAATACATACACGAAGCTAAC^ 2374 
2375 TGATTAAAAGGAGTATATAAATTTACA^ 24 f* 
2467 GGATTATGAGATTGAAGJ-AGATTGTACAGCTTC 2554 
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1 ATC CAG ACT TAC GTT TTC TOG AAC CTT CAT GAA CCT GTT CGA AAT CAG TAT GAT TTT GAA GGA AGG AAA 
1 lie Gin Thr Tyr Val Phe Trp Asn Leu His Glu Pro Val Arg Asn Gin Tyr Asp Phe Glu Gly Arg Lys 

70 GAT TTG ATT- AAT TTT GTG AAG TTG GTG GAG ASA GCT GGC TTA TTT GTT CAT ATA AGG ATT GGG CCT TAT 
24 Asp Leu lie Asn Phe Val Lys Leu Val Glu Arg Ala Gly Leu Phe Val His He Arg lie Gly Pro Tyr 

139 GTT TGT GCA GAA TGG AAC TAT GGT GGG TTT CCT CTT TGG TTG CAT TTC ATT CCT GGA ATT GAA TTT CGA 
47 Val Cys Ala Glu Trp Asn Tyr Gly Gly Phe Pro Leu Trp Leu His Phe He Pro Gly He Glu Phe Arg 

208 ACC GAC AAT GAA CCG TTC AAG GCA GAA ATG AAG CGA TTC ACA GCT AAA ATT GTT GAC ATG ATC AAG CAA 
70 Thr Asp Asn Glu Pro Phe Lys Ala Glu Met Lys Arg Phe Thr Ala Lys He Val Asp Met He Lys Gin 

277 GAA AAT CTA TAT GCA TCC CAG GGT GGG CCG GTT ATC TTG TCT CAG ATA GAA AAT GAG TAT GGC AAT GGT 
93 Glu Asn Leu Tyr Ala Ser Gin Gly Gly Pro Val He Leu Ser Gin He Glu Asn Glu Tyr Gly Asn Gly 

346 GAT ATT GAG TCT CGT TAT GGT CCT CGT GCC AAA CCT TAC GTG AAC TGG GCA GCA TCA ATG GCT AGG TCT 
116 Asp He Glu Ser Arg Tyr Gly Pro Arg Ala Lys Pro Tyr Val Asn Trp Ala Ala Ser Met Ala Thr Ser 

415 TTA AAT ACG GGA GTG CCA TGG GTT ATG TGT CAG CAA CCA GAT GCC CCT CCT TCC GTT ATT AAC ACT TGC 
139 Leu Asn Thr Gly Val Pro Trp Val Met Cys Gin Gin Pro Asp Ala Pro Pro Ser Val He Asn Thr Cys 

484 AAT GGA TTT TAT TGT GAC CAA TTC AAG CAA AAT TCC GAT AAA ACA CCC AAG ATG TGG ACT GAG AAT TGG 
162 Asn Gly Phe Tyr Cys Asp Gin Phe Lys Gin Asn Ser Asp Lys Thr Pro Lys Met Trp Thr Glu Asn Trp 

553 ACC GGA TGG TTT CTG TOG TTT GGT GGT CCT GTC CCT TAC AGA CCA GTG GAA GAC ATC GCT TTC GCT GTG 
185 Thr Gly Trp Phe Leu Ser Phe Gly Gly Pro Val Pro Tyr Arg Pro Val Glu Asp He Ala Phe Ala Val 

622 GCT CGA TTT TTC CAG CGA GGC GGA ACT TTC CAG AAC TAT TAC ATG TAC CAC GGG GGA ACT AAC TTT GGG 
208 Ala Arg Phe Phe Gin Arg Gly Gly Thr Phe Glu Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly 

691 AGA ACC ACT GGT GGA CCG TTT ATT GCA ACT AGC TAT GAC TAT GAT GCC CCT CTC GAC GAA TAC GG 
231 Arg Thr Ser Gly Gly Pro Phe lie Ala Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Tyr 



69 
23 

138 
46 

207 
69 

276 
92 

345 
115 

414 

138 

483 
161 

552 
184 

621 
207 

690 
230 

755 
252 
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1 ATC CAG ACA TAT Gn ?TT TCG AAT GTT CAT GAG OCT TCT CCT GGC AAT TAC AAT TTT GAA GGA AGA TAT 69 
1 lie Gin Thr Tyr Val Phe Trp Asn Val His Glu Pro Ser Pro Gly Asn Tyr Asn Phe Glu Gly Arg Tyr 23 



70 GAC CTG GTG AGG TTT GTA AAA ACG ATT CAG AAA GCA GGG CTG TAT GCT CAT CTT CGA ATT GGC CCT TAC 
24 Asp Leu Val Arg Phe Val Lys Thr He Gin Lys Ala Gly Leu Tyr Ala His Leu Arg He. Gly Pro Tyr 

139 GTT TGT GCA GAG TGG AAT TTT GGA GGG TTT CCA GTA TOG CTG AAG TAT GTA CCT GGC ATT AGC TTC AGA 
47 Val Cys Ala Glu Trp Asn Phe Gly Gly Phe Pro Val Trp Leu Lys Tyr Val Pro Gly He Ser Phe Arg 



208 



622 TTT ATA CAA AGA GGA GGA TCT TTT GTT AAC TAT TAC ATG TAC CAT GGG GGC ACG AAC TTT GGA CGC ACT 
208 Phe He Gin Arg Gly Gly Ser Phe Val Asn Tyr Tyr Met Tyr His Gly Gly Thr Asn Phe Gly Arg Thr 

691 GCG GGT GGG CCA TTC ATC ACT ACC AGC TAT GAT. TAT GAT GCC CCC CTC GAC GAG TAT GG 
231 Ala Gly Gly Pro Phe He Thr Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Asp Glu Tyr 



138 
46 



207 
69 



GCT GAT AAT GAA CCT TTC AAG AAC GCA ATG AAA GGG TAT GCT GAG AAA ATT GTT AAC TTG ATG AAG ATC 276 



92 



70 Ala Asp Asn Glu Pro Phe Lys Asn Ala Met Lys Gly Tyr Ala Glu Lys He Val Asn Leu Met Lys He 
277 ATA ATC TTT TOG AGT CTC AGG GTG GTC CAA TCA TAC TCT CAC AGA TTG AGA ATG AGT ATG GGC CTC AAG 345 



115 



93 He He Phe Ser Ser Leu Arg Val Val Gin Ser Tyr Ser His Arg Leu Arg Met Ser Met Gly Leu Lys 

346 CCA AGG TAC TTG GAG CAC CGG GAC ATC AGT ATT CAA CAT GGG CTG CAA ATA TGG CAG TTG GAT TTG AAC 
116 Pro Arg Tyr Leu Glu His Arg Asp He Ser He Gin His Gly Leu Gin He Trp Gin Leu Asp Leu Asn 

415 ACA GGC GTC CCA TGG GTG ATG TGC AAG GAA GAA GAT GCA CCA GAT CCT GTG ATC AAC ACA TGC AAT GGT 483 
139 Thr Gly Val Pro Trp Val Met Cys Lys Glu Glu Asp Ala Pro Asp Pro Val He Asn Thr Cys Asn Gly 161 

484 TTC TAC TGT GAT AAT TTC TTC CCA AAC AAA CCA TAC AAA CCT GCA ATT TGG ACT GAA GCT TGG AGT GGA 
162 Phe Tyr Cys Asp Asn Phe Phe Pro Asn Lys Pro Tyr Lys Pro Ala He Trp Thr Glu Ala Trp Ser Gly 

553 TGG TTC TCG GAA TTT GGC GGT CCC CTT CAT CAG AGA CCA GTT CAG GAT TTG GCA TTT GCT GTT GCC CAA 
185 Trp Phe Ser Glu Phe Gly Gly Pro Leu His Gin Arg Pro Val Gin Asp Leu Ala Phe Ala Val Ala Gin 207 

690 
230 



414 
138 



552 
184 



621 



749 
250 
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x GCAAcwcrrci* <~ w 12 

13 CTGAATAACACCGGTAAACGGCCAAT^ 103 



104 
1 


ATG 
Met 


AAC 
Asn 


ACA 
Thr 


ATG 
Met 


AGT 
Ser 


TGT 
Cys 


TTG 
Leu 


TCC 
Ser 


TCT 
Ser 


AAT TTC AAG TTC GTT TTC CTT 
Asn Phe Lys Phe Val Phe Leu 


GCC 
Ala 


TCG 
Ser 


ACT GTG 
Thr Val 


ATA 
He 


TGG 
Trp 


ATG 
Met 


172 
23 


173 
24 


ACG 
Thr 


GTA 
Val 


ATG 
Met 


TCG 
Ser 


TCG 
Ser 


TCG 
Ser 


TTA 
Leu 


GCA 
Ala 


GCA 
Ala 


GTA GAT GCT TCC AAT GTT ACT 
Val Asp Ala Ser Asn Val Thr 


ACT 
Thr 


ATT 
He 


GGT ACT 
Gly Thr 


GAT 
Asp 


AGT 
Ser 


GTG 
Val 


241 
46 


242 
47 


Thr 


TAC 
Tyr 


GAT 
Asp 


CGA 
Arg 


CGC 
Arg 


TCG 

Ser 


TTG 
Leu 


ATT 
lie 


ATT AAC GGC CAG AGG AAG CTG CTC 
He Asn Gly Gin Arg Lys Leu Leu 


ATC 
He 


TCC 
Ser 


GCT TCC 
Ala Ser 


ATT 
He 


CAC 
His 


TAT 
Tyr 


310 
69 


311 
70 


CCT 
Pro 


Arg 


Ser 


GTC 
Val 


CCT 
Pro 


GCC 
Ala 


ATG 
Met 


TGG 
Trp 


CCT GGT CTG GTT CGA TTG GCG AAG GAA GGA 
Pro Gly Leu Val Arg Leu Ala Lys Glu Gly 


GGA GTG 
Gly Val 


GAT 
Asp 


GTT 
Val 


ATT 
He 


379 
92 


380 
93 


GAA 
Glu 


ACG 
Thr 


TAT 
Tyr 


GTT 
Val 


Phe 


TGG 
Trp 


AAC 
Asn 


GGT 
Gly 


CAC 
His 


GAA CCT TCT COG GGC AAT TAT TAC 
Glu Pro Ser Pro Gly Asn Tyr Tyr 


TTT 
Phe 


GGA GGA 
Gly Gly 


AGG 
Arg 


TTT 
Phe 


GAT 
Asp 


446 

115 


449 
116 


CTA 
Leu 


GTC 
Val 


AAA 
Lys 


TTT 
Phe 


TGT 
Cys 


AAG 
Lys 


ATC 
He 


ATT 
He 


CAG CAG GCT GGA ATG TAT ATG ATT 
Gin Gin Ala Gly Met Tyr Met He 


CTT 
Leu 


COG 
Arg 


ATT GGA 
He Gly 


CCA 
Pro 


TTT 
Phe 


GTA 
val 


517 
138 


516 
139 


GCT 
Ala 


GCA 
Ala 


GAA 
Glu 


TGG 
Trp 


AAC 
Asn 


TTT 
Phe 


GGT 
Gly 


GGA 
Gly 


CTT 
Leu 


CCT GTG TGG TTG CAT TAT GTG 
Pro Val Trp Leu His Tyr Val 


CCA GGT 
Pro Gly 


ACC ACC 
Thr Thr 


TTT 
Phe 


GGG 
Arg 


ACT 
Thr 


586 
161 


587 
162 


GAT 
Asp 


Sex 


GAA 
Glu 


Pro 


Phe 


AAG 
Lys 


TAT 
Tyr 


CAC 
His 


ATG 
Met 


CAG AAG TTC ATG ACA TAT ACA GTG 
Gin Lys Phe Met Thr Tyr Thr Val 


AAC 
Asn 


TTA ATG 
Leu Met 


AAG 
Lys 


AGA 
Arg 


GAG 
Glu 


655 
184 


656 
165 


AGG 
Arg 


Leu 


Phe 


GCA 
Ala 


Ser 


CAA 
Gin 


GGA 
Gly 


GGT 
Gly 


CCA 
Pro 


ATC ATC TTG TCA CAG GTA GAA 
He He Leu Ser Gin Val Glu 


AAT 
Asn 


GAG 

Glu 


TAC GGC 
Tyr Gly 


TAC 
Tyr 


TAT 
Tyr 


GAA 
Glu 


724 

207 


725 
208 


AAT 
Asn 


GCA 
Ala 


TAT 
Tyr 


GGA 
Gly 


GAA 
Glu 


GGA 
Gly 


GGG 
Gly 


AAA 

Lys 


AGG TAT GCC TTA TGG GCT GCT AAA 
Arg Tyr Ala Leu Trp Ala Ala Lys 


ATG 
Met 


GCC 
Ala 


CTT TCT 
Leu Ser 


CAA 
Gin 


AAT 

Asn 


ACT 
Thr 


793 

230 


794 
231 


GGT 
Gly 


GTA 
Val 


CCT 
Pro 


TGG 
Trp 


ATA 

lie 


ATG 
Met 


TGC 
Cys 


CAG 

Gin 


CAG TAT GAT GCT CCT GAT CCT GTG 
Gin Tyr Asp Ala Pro Asp Pro Val 


ATT GAC 
He Asp 


ACA TGC 
Thr Cys 


AAT 

Asn 


TCA 
Ser 


TTT 
Phe 


862 

253 


863 
254 


TAC 
Tyr 


TGC 
Cys 


GAC 
Asp 


CAA 
Gin 


TTT 
Phe 


AAA 

Lys 


CCA 
Pro 


ATC 
He 


TCT 
Ser 


CCA AAC AAG CCC AAA ATT TGG 
Pro Asn Lys Pro Lys He Trp 


ACA 
Thr 


GAG 
Glu 


AAC TGG 
Asn Trp 


CCG 
Pro 


GGA 
Gly 


TGG 
Trp 


931 
276 


932 
277 


TTC 
Phe 


AAG 
Lys 


ACA 
Thr 


TTT 
Phe 


GGG 
Gly 


GCC 
Ala 


AGA 
Arg 


GAT 
Asp 


CCT 
Pro 


CAC AGG CCT GCA GAA GAT GTT 
His Arg Pro Ala Glu Asp Val 


GCT 
Ala 


TAT 
Tyr 


TCC GTG 
Ser Val 


GCT 
Ala 


CGT 
Arg 


TTT 
Phe 


1000 
299 


1001 
300 


TTC 
Phe 


CAA 
Gin 


AAA 

Lys 


GGA 
Gly 


GGA 
Gly 


AGC 
Ser 


GTG 
Val 


CAG 
Gin 


AAT TAT TAC ATG TAC CAT GGT GGG ACG 
Asn Tyr Tyr Met Tyr His Gly Gly Thr 


AAC 
Asn 


TTT GGC 
Phe Gly 


AGG 
Arg 


ACA 
Thr 


GCA 
Ala 


1069 
322 


1070 
323 


GGT 
Gly 


GGC 
Gly 


CCT 
Pro 


TTC 
Phe 


lie 


ACC 
Thr 


ACA 
Thr 


AGT 
Ser 


TAT GAC TAT GAT GCC CCA ATT GAC 
Tyr Asp Tyr Asp Ala Pro He Asp 


GAA 
Glu 


TAT 
Tyr 


GGT TTA 
Gly Leu 


CCA 
Pro 


AGG 
Arg 


TTT 
Phe 


1138 
' 345 


1139 
346 


CCA 
Pro 


AAA 
Lys 


TGG 
Trp 


GGT 
Gly 


CAC 
His 


CTT 
Leu 


AAA 
Lys 


GAA 
Glu 


CTT 
Leu 


CAT AAA GTC ATA AAA TCG TGT 
His Lys Val He Lys Ser Cys 


GAG 
Glu 


CAT 
His 


GCT CTG 
Ala Leu 


CTG 
Leu 


AAC 
Asn 


AAT 
Asn 


1207 
368 


1208 
369 


GAT 
ASp 


CCA 
Pro 


ACT 
Thr 


CTT 
Leu 


CTT 
Leu 


TCA 
Ser 


TTA 
Leu 


GGT 
Gly 


CCT 
Pro 


CTA CAA GAG GCT GAT GTT TAT GAA GAT 
Leu Gin Glu Ala Asp Val Tyr. Glu Asp 


GCT TCA 
Ala Ser 


GGC 
Gly 


GCT 
Ala 


TGT 
Cys 


1276 
391 


1277 
392 


GCT 
Ala 


GCC 
Ala 


TTT 
Phe 


CTC 
Leu 


GCG 
Ala 


AAT 
Asn 


ATG 
Met 


GAT 
Asp 


GAC AAA AAT GAC AAG GTG GTA CAG 
Asp Lys Asn Asp Lys Val Val Gin 


TTC 
Phe 


CGA 
Arg 


CAT GTA 
His Val 


TCA 
Ser 


TAC 
Tyr 


CAC 
His 


1345 
414 


1346 
415 


TTG 
Leu 


CCA 
Pro 


GCA 
Ala 


TGG 
Trp 


TCT 
Ser 


GTT 
Val 


AGC 
Ser 


ATT 
He 


TTG 
Leu 


CCA GAC TGC AAA AAT GTA GCG 
Pro Asp Cys Lys Asn Val Ala 


TTC 
Phe 


AAC 
Asn 


ACA GCA 
Thr Ala 


AAG 
Lys 


GTT 
Val 


GGA 
Gly 


1414 
437 


1415 
438 


TGT 
Cys 


CAA 
Gin 


ACT 
Thr 


TCT 
Ser 


ATT 

lie 


GTC 
val 


AAT 
Asn 


ATG 
Met 


GCA 
Ala 


CCC ATA GAT TTG CAT CCC ACC 
Pro He Asp Leu His Pro Thr 


GCA 
Ala 


AGT 
Ser 


TCA CCA 
Ser Pro 


AAG 

Lys 


AGA 
Arg 


GAC 
ASp 


1483 
460 
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;e xd number 7 coat . 



1484 ATC AAG 
461 lie Lys 


TCT CTT 
Ser Leu 


CAG 
Gin 


TGG 
Trp 


GAA 
Glu 


GTC 
Val 


TTC 
Phe 


AAG 
Lys 


GAA 
Glu 


ACA 
Thr 


GCT 
Ala 


GGA 

Gly 


GTA 
Val 


TGG GGA 
Trp Gly 


GTT 

val 


GCT 
Ala 


GAT 
ASp 


TTC 

Phe 


ACT 

Thr 


AAA 
Lys 


1552 
483 


1553 AAC GGA 
484 Asn Gly 


TTT GTA 
Phe Val 


GAT 
Asp 


CAC 
His 


ATT 
He 


AAC 
Asn 


ACC 
Thr 


ACA 
Thr 


AAA 

Lys 


GAT 
Asp 


GCT 
Ala 


ACA 
Thr 


GAC 
Asp 


TAC 
Tyr 


CTC 
Leu 


TGG 
Trp 


TAC 
Tyr 


ACA 
Thr 


ACA 
Thr 


AGT 
Ser 


ATT 
He 


1621 
506 


1622 TTT GTT 
507 Phe Val 


CAT GCA 
His Ala 


GAG 
Glu 


GAG 
Glu 


GAT 
ASp 


TTC 
Phe 


CTA 
Leu 


AGA 
Arg 


AAC 
Asn 


AGA 
Arg 


GGC 
Gly 


ACT 
Thr 


GCA 
Ala 


ATG 
Met 


CTT 
Leu 


TTC 
Phe 


GTT 
Val 


GAA 
Glu 


TCA 
Ser 


AAG 
Lys 


GGT 
Gly 


1690 
529 


1691 CAT GCT 
530 His Ala 


ATG CAT 
Met His 


GTC 
Val 


TTC 
Phe 


ATC 
He 


AAT 
Asn 


AAA 
Lys 


AAG 
Lys 


CTT 
Leu 


CAA 
Gin 


GCC 
Ala 


AGT 
Ser 


GCA 
Ala 


TCT GGA 
Ser Gly 


AAT 
Asn 


GGC 
Gly 


ACA 
Thr 


GTG 
Val 


CCA 
Pro 


CAG 
Gin 


1759 
552 


1760 TTC AAG 
553 Phe Lys 


TTT GGA ACT 
Phe Gly Thr 


CCT 
Pro 


ATT 
He 


GCT 
Ala 


CTA 
Leu 


AAG 
Lys 


GCA 
Ala 


GGG 
Gly 


AAG 
Lys 


AAT 
Asn 


GAA 
Glu 


ATT 
He 


TCC 
Ser 


TTG 
Leu 


TTA 
Leu 


AGC 
Ser 


ATG 
Met 


ACT 
Thr 


GTG 
Val 


1826 
575 


1829 GGC CTA 
576 Gly Leu 


CAA ACA GCT GGA GOG 
Gin Thr Ala Gly Ala 


TTT 
Phe 


TAT 

Tyr 


GAA 
Glu 


TGG 
Trp 


ATT 
He 


GGA 
Gly 


GCT 
Ala 


GGT 
Gly 


CCA ACA 
Pro Thr 


AGT 
Ser 


GTC 
Val 


AAA 

Lys 


GTT 
Val 


GCA 
Ala 


GGG 
Gly 


1897 
598 


1898 TTC AAG ACT GGG ACT ATG GAC 
599 Phe Lys Thr Gly Thr Met Asp 


TTG 
Leu 


ACT 

Thr 


GCG 
Ala 


TCT 
Ser 


GCT 
Ala 


TGG 
Trp 


ACC 
Thr 


TAT 
Tyr 


AAG 
Lys 


ATT 
He 


GGA 
Gly 


TTG 
Leu 


CAA 
Gin 


GGA 
Gly 


GAA 
Glu 


CAT 
His 


1966 
621 


1967 TTG AGG 
622 Leu Arg 


ATA CAG 
He Gin 


AAG 
Lys 


TCA 

Ser 


TAT 
Tyr 


AAC 
Asn 


TTG 
Leu 


AAG 
Lys 


AGT 
Ser 


AAA 

Lys 


ATT 
He 


TGG 
Trp 


GCA 
Ala 


CCA 
Pro 


ACT 
Thr 


TCG 
Ser 


CAG 
Gin 


CCA 
Pro 


CCA 
Pro 


AAG 

Lys 


CAA 
Gin 


2035 
644 


2036 CAG CCC 
645 Gin Pro 


CTC ACA 
Leu Thr 


TGG 
Trp 


TAT 
Tyr 


AAG 
Lys 


GCA 
Ala 


GTA 
Val 


GTA 
Val 


GAT 
Asp 


GCG 
Ala 


CCT 
Pro 


CCT 
Pro 


GGT 
Gly 


AAT 
Asn 


GAA 
Glu 


CCT 
Pro 


GTT 
Val 


GCA 
Ala 


CTT 
Leu 


GAT 
Asp 


ATG 
Met 


2104 
667 


2105 ATT CAT ATG GGA AAA 
668 lie His Met Gly Lys 


GGA ATG 
Gly Met 


GCT TGG 
Ala Trp 


TTG 
Leu 


AAT 
Asn 


GGA 
Gly 


CAA 
Gin 


GAA 
Glu 


ATT 
He 


GGC AGA 
Gly Arg 


TAT 
Tyr 


TGG 
Trp 


CCG 
Pro 


AGG 
Arg 


AGA 
Arg 


ACT 
Thr 


2173 
690 


2174 TCT AAA TAT GAG AAT TGT GTT 
691 Ser Lys Tyr Glu Asn Cys Val 


ACT 
Thr 


CAA 
Gin 


TGT 
Cys 


GAC 
Asp 


TAC 
Tyr 


AGA 
Arg 


GGC 
Gly 


AAA 
Lys 


TTT 
Phe 


AAC 
Asn 


CCT 
Pro 


GAT 
Asp 


AAG 

Lys 


TGT 
Cys 


GTC 

Val 


ACT 
Thr 


2242 
713 


2243 GGC TGT GGA CAA 
714 Gly Cys Gly Gin 


CCT 
Pro 


ACA 
Thr 


CAG 
Gin 


AGA TGG 

Arg Trp 


TAT 
Tyr 


CAT 
His 


GTG 
Val 


CCA 
Pro 


CGA 
Arg 


TCT 
Ser 


TGG 
Trp 


TTC 
Phe 


AAG 
Lys 


CCA 
Pro 


TCA 
Ser 


GGA 
Gly 


AAT 
Asn 


GTC 
Val 


2311 
736 


2312 TTA ATT 
737 Leu lie 


ATC TTT 
He Phe 


GAG 
Glu 


GAA 
Glu 


ATA 

He 


GGT GGA 
Gly Gly 


GAT 
Asp 


CCC 
Pro 


TCT 
Ser 


CAA 
Gin 
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Figure 4. Autoradiograph of northern blot analysis of TBG expression in various 
plant tissues. Twenty |ig of total RNA extracted from flowers (F), leaves (L), roots (R) 
and stems (S) was loaded in each lane. RNAs were separated in an agarose gel and 
transferred to nylon membrane. Blots were hybridized using the probes indicated to the 
right, washed to a final stringency of 0.1 X SSC at 65°C and were used to expose x-ray 
film. A 26S ribosomal gene clone from soybean was used as a loading control for each 
blot and one example is shown. 
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Figure 5. Autoradiograph of northern blot analysis of TBG expression in fruit 
tissues. Twenty ^tg of total RNA extracted from peel and outer pericarp tissue was 
ioaded in each lane. Fruit were harvested at 10, 20, 30, 35, and 40 days post- 
pollenation and at the breaker (Br), turning (Tr), pink (Pk) t red (Rd) and over ripe (OR) 
stages. RNAs were separated in an agarose gel and transferred to nylon membrane. 
Blots were hybridized using the probes indicated to the right, washed to a final strin- 
gency of 0.1X SSC at 65°C and were used to expose x-ray film. A 26S ribosomal gene 
clone from soybean was used as a loading control for each blot and one example is 
shown. A cDNA clone for tomato polygalacturonase (PG) was also used as a probe to 
show a well characterized, fruit-ripening-specific control. 
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Figure 6. Autoradiograph of northern blot analyse of TBG express.or, , m frurt 
tissues. Twenty ug of total RNA extracted from mature green or turn.ng stage frurt peel 
(P), outer pericarp (OP), inner pericarp (IP) and locular (L) tissue was loaded m each 
lane RNAs were separated in an agarose gel and transferred to nylon membrane. Blots 
were hybridized using the probes indicated to the right, washed to a final stnngency of 
0 1X SSC at 65°C and were used to expose x-ray film. A 26S ribosomal gene clone 
from soybean was used as a loading control for each blot and one example is shown. A 
cDNA clone for tomato polygalacturonase (PG) was also used as a probe to show a 
well characterized, fruit-ripening-specific control. 
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Figure 7 Autoradiograph of northern blot analysis of TBG expression in normal 
and mutant fruit tissues, twenty fig of total RNA extracted from peel and outer peri- 
carp tissue at various days post-poliination (dpp) was loaded in each lane. RNAs were 
separated in an agarose gel and transferred to nylon membrane. Blots were hybndized 
using the probes indicated to the right, washed to a final stringency of 0.1 X SSC at 
65»C and were used to expose x-ray film. A 26S ribosomal gene clone from soybean 
was used as a loading control for each blot and one example is shown. A cDNA clone 
for tomato polygalacturonase (PG) was also used as a probe to show a well character- 
ized, fruit-ripening-specific control. The - and > marks on the left indicate the position of 
the tomato 27S and 18S rRNAs respectively. 
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Figure 8. Autoradiograph of northern blot analysis of TBG expression in response 
to ethylene treatment of mature green fruit tissues. Twenty iig of total RNA ex- 
tracted from peel and outer pericarp tissue at various times (1. 2. 12 and 24 hours) after 
treatment with (+) or without (-) 10 ppm ethylene was loaded in each lane. RNAs were 
separated in an agarose gel and transferred to nylon membrane. Blots were hybridized 
using the probes indicated to the right, washed to a final stringency of 0.1 X sso at 
65°C and were used to expose x-ray film. A 26S ribosomal gene clone from soybean 
was used as a loading control for each blot and one example is shown. The - marks on 
the left indicate the position of the tomato 27S rRNA. 
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Figure 9. Western blot analysis of TBG4 expression by yeast. A yeast clone was 
isolated that secreted high levels of FLAG-TBG4 fusion protein into the culture ^medium. 
Protein samples were separated in an 8% acrylamide gel. transferred to mtroc ulose 
and were blotted with M1 anti-FLAG primary antibody. Blots were washed an ' b^ted 
with an alkaline-phosphatase conjugated secondary anybody and alkaline phosphatase 
activity was detected using Sigma Fast substrate. Lane 1 , culture medium of an 
untransformed yeast clone was used as a negative control. Lane 2, culU.re medium of 
yeast clone expressing FLAG-TBG4 fusion protein. Lane 3, Affinity punfied FLAG-TBG4 
fusion protein. 
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Figure 12. Northern blot analysis of TBG4 expression in transgenic fruit contain- 
ing TBG4 antisense construct A. Total RNA was extracted from mat ^ e J^ e ^ 
days post-pollenation (MG), turning/breaker + 3 (TR) and red/breaker + 7 (RD) fruft and 
twenty was loaded in each lane. RNAs were separated ,n an agarose get trans- 
ferred to nylon membrane. Blots were hybridized using the probes .nd.cated to the right, 
washed to a final stringency of 0.1X SSC at 65°C and were used to exp os e x-ray -fflm. 
A 26S ribosomal gene clone from soybean was used as a loading control The marks - 
and { denote the positions of the endogenous TBG4 and antisense mRNAs respec- 
tively Lanes 2-8 correspond to transgenic lines 2-8 in Figures 1 1A-E. B. Chart of TBG4 
mRNA levels in lines 2-8. Autoradiographs were scanned using a te" s *™* e < ™* 
TBG4 mRNA levels were corrected against the loading controls. TBG4 mRNA levels 
are shown in arbitrary units. 
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